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PROTEIN MODIFICATION AND MAINTENANCE MOLECULES 

TECHNICAL FIELD 

The invention relates to novel nucleic acids, protein modification and maintenance molecules 
5 encoded by these nucleic acids, and to the use of these nucleic acids and proteins in the diagnosis, 
treatment, and prevention of gastrointestinal, cardiovascular, autoimmune/inflanimatory, cell 
proliferative, developmental, epithelial, neurological, and reproductive disorders. The invention also 
relates to the assessment of the effects of exogenous compounds on the expression of nucleic acids 
and protein modification and maintenance molecules. 

10 

BACKGROUND OF THE INVENTION 
The cellular processes regulating modification and maintenance of protein molecules 
coorditiate their function, conformation, stabilization, and degradation. Each of these processes is 
mediated by key enzymes or proteins such as kinases, phosphatases, proteases, protease inhibitors, 
IS isomerases, transferases, and molecular chaperones. 
Kinases 

Kinases catalyze the transfer of high-energy phosphate groups from adenosine triphosphate 
(ATP) to target proteins on the hydroxyamino acid residues serine, threonine, or tyrosine. Addition 
of a phosphate group alters the local charge on the acceptor molecule, causing internal 

20 conformational changes and potentiaUy influencing intermolecular contacts. Reversible protein 
phosphorylation is the ubiquitous strategy used to control many of the intracellular events in 
eukaryotic cells. It is esthnated that more than ten percent of proteins active in a typical mammalian 
cell are phosphorylated. Extracellular signals including hormones, neurotransmitters, and growth and 
differentiation factor can activate kinases, which can occur as cell surface receptors or as the activator 

25 of the final effector protein, but can also occur along the signal transduction pathway. Kinases are 
involved in all aspects of a cell's function, from basic metaboHc processes, such as glycolysis, to cell- 
cycle regulation, differentiation, and communication with the extracellular environment through 
signal transduction cascades. Inappropriate phosphorylation of proteins m cells has been linked to 
changes in cell cycle progression and cell differentiation. Changes in the cell cycle have been linked 

30 to induction of apoptosis or cancer. Changes in cell differentiation have been linked to diseases and 
disorders of the reproductive system, inmiune system, and skeletal muscle. 

There are two classes of protein kinases. One class, protein tyrosine kinases (PTKs), 
phosphorylates tyrosine residues, and the other class, protein serine/threonme kinases (STKs), 
phosphoiylates serine and threonine residues. Some PTKs and STKs possess structural 

35 characteristics of both families and have dual specificity for both tyrosine and serine/threonine 
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residues. Almost all kinases contain a conserved 250-300 antiino acid catalytic domain containing 
specific residues and sequence motifs characteristic of the kinase family. (Reviewed in Hardie, G. 
and Hanks, S. (1995) The Protein Kinase Facts Book, Vol I p.p. 17-20 Academic Press, San Diego, 
CA.). 
5 Phosphatases 

Phosphatases hydrolytically remove phosphate groups from proteins. Phosphatases are 
essential in determimng the extent of phosphorylation in the cell and, together with kinases, regulate 
key cellular processes such as metabolic enzyme activity, proliferation, cell growth and 
differentiation, cell adhesion, and cell cycle progression. Protein phosphatases are characterized as 

10 either serine/threonine- or tyrosine-specific based on iheir preferred phospho-amino acid substrate. 
Some phosphatases (DSPs, for dual specificity phosphatases) can act on phosphorylated tyrosine, 
serine, or threonine residues. The protein serine/threonine phosphatases (PSPs) are in^)ortant 
regulators of many cAMP-mediated hormone responses m cells. Protein tyrosine phosphatases 
(PTPs) play a significant role in cell cycle and cell signaling processes. 

15 Proteases 

Proteases cleave protems and peptides at the peptide bond that forms the backbone of the 
protein or peptide chain. Proteolysis is one of the most important and frequent enzymatic reactions 
that occurs both within and outside of cells. Proteolysis is responsible for the activation and 
maturation of nascent polypeptides, the degradation of misfolded and damaged proteins, and the 

20 controlled turnover of peptides within the cell. Proteases participate in digestion, endocrine function, 
and tissue remodeling during embryonic development, wound healing, and normal growth. Proteases 
can play a role in regulatory processes by affecting the half life of regulatory proteins. Proteases are 
involved in the etiology or progression of disease states such as inflammation, angiogenesis, tumor 
dispersion and metastasis, cardiovascular disease, neurological disease, and bacterial, parasitic, and 

25 viral infections. 

Proteases can be categorized on the basis of where they cleave their substrates. 
Exopeptidases, which include aminopeptidases, dipeptidyl peptidases, tripeptidases, 
carboxypeptidases, peptidyl-di-peptidases, dipeptidases, and omega peptidases, cleave residues at the 
termini of their substrates. Endopeptidases, includmg serine proteases, cysteine proteases, and 

30 metalloproteases, cleave at residues within the peptide. Four principal categories of mammalian 

proteases have been identified based on active site structure, mechanism of action, and overall three- 
dimensional structure. (See Beynon, R.J. and J.S. Bond (1994) Proteolytic Enzvmes: A Practical 
Approach, Oxford University Press, New York, NY, pp. 1-5.) 
Serine Proteases 

35 The serine proteases (SPs) are a large, widespread family of proteolytic enzymes that include 
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the digestive enzymes trypsin and chymotrypsin, components of the complement and blood-clotting 
cascades, and enzymes that control the degradation and tumover of macromolecules within the cell 
and in the extracellular matrix. Most of the more than 20 subfamilies can be grouped into six clans, 
each with a common ancestor. These six clans are hypothesized to have descended from at least four 
5 evolutionarily distinct ancestors. SPs are named for the presence of a serine residue found in the 
active catalytic site of most families. The active site is defined by the catalytic triad, a set of 
conserved asparagine, histidine, and serine residues critical for catalysis. These residues fonn a 
charge relay network that facilitates substrate binding. Other residues outside the active site form an 
oxyanion hole that stabilizes the tetrahedral transition intermediate formed during catalysis. SPs have 
10 a wide range of substrates and can be subdivided into subfamilies on the basis of their substrate 
specificity. The main subfamilies are named for the residue(s) after which they cleave: trypases 
(after arginine or lysine), aspases (after aspartate), chymases (after phenylalanine or leucine), metases 
(methionine), and serases (after serine) (Rawlmgs, N.D. and AJ. Barrett (1994) Meth. Enz. 244:19- 
61). 

15 Most mammalian serine proteases are synthesized as zymogens, inactive precursors that are 

activated by proteolysis. For example, trypsinogen is converted to its active form, trypsin, by 
enteropeptidase. Enteropeptidase is an intestinal protease that removes an N-terminal fi*agment fi-om 
trypsinogen. The remaining active fragment is trypsin, which in turn activates the precursors of the 
other paiicreatic enzymes. Likewise, proteolysis of prothrombin, the precursor of thrombin, generates 

20 three separate polypeptide fragments. The N-tenninal fragment is released while the other two 
fragments, which comprise active thrombin, remain associated through disulfide bonds. 

The two largest SP subfamilies are the chymotrypsin (SI) and subtilisin (S8) families. Some 
members of the chymotrypsin family contain two structural domains unique to this family. Kringle 
domains are triple-looped, disulfide cross-linked domains found in varying copy number. Kringles 

25 are thought to play a role in binding mediators such as membranes, other proteins or phospholipids, 
and in the regulation of proteolytic activity (PROSTTE PDOC00020). Apple domains are 90 amino- 
acid repeated domains, each containing six conserved cysteines. Three disulfide bonds link the first 
and sixth, second and fifth, and third and fourth cysteines (PROSITE PDOC00376). Apple domains 
are involved in protein-protein interactions. SI fanaily members include trypsin, chymotrypsin, 

30 coagulation factors K-Xn, complement factors B, C, and D, granzymes, kallikrein, and tissue- and 
urokinase-plasminogen activators. The subtilisin family has members found in the eubacteria, 
archaebacteria, eukaryotes, and viruses. Subtilisins include the proprotein-processing endopeptidases 
kexin and furin and the pituitary prohormone convertases PCI, PC2, PCS, PC6, and PACE4 
(Rawlings and Barrett, supra ). 

35 SPs have functions in many normal processes and some have been implicated in the etiology 
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or treatment of disease. Enterokinase, the initiator of intestinal digestion, is found in the intestinal 
brush border, where it cleaves the acidic propeptide from trypsinogen to yield active trypsin 
(Kitamoto, Y. et al. (1994) Proc. Natl. Acad. Sci. USA 91: 7588-7592). Prolylcarboxypeptidase, a 
lysosomal serine peptidase that cleaves peptides such as angiotensin n and III and [des-Arg9] 
5 bradykinin, shares sequence homology with members of both the serine carboxypeptidase and 

prolylendopeptidase families (Tan, F. et al. (1993) J. Biol. Chem. 268:16631-16638), The protease 
neuropsin may influence synapse formation and neuronal connectivity in the hippocampus in 
response to neural signaling (Chen, Z.-L. et al. (1995) J Neurosci 15:5088-5097). Tissue 
plasminogen activator is useful for acute management of stroke (Zivin, J.A. (1999) Neurology 53:14- 

10 9) and myocardial infarction (Ross, A.M. (1999) Clin Cardiol 22: 165-71). Some receptors (PAR, for 
proteinase-activated receptor), highly expressed throughout the digestive tract, are activated by 
proteolytic cleavage of an extracellular domain. The major agonists for PARs, thrombin, trypsin, and 
mast cell tryptase, are released in allergy and inflammatory conditions. Control of PAR activation by 
proteases has been suggested as a promising therapeutic target (VergnoUe, N. (2000) Aliment. 

15 Pharmacol. Ther. 14:257-266; Rice, K.D. et al. (1998) Curr. PharnL Des. 4:381-396). Prostate- 
specific antigen (PSA) is a kallikrein-like serine protease synthesized and secreted exclusively by 
epithelial cells in the prostate gland. Serum PSA is elevated in prostate cancer and is the most 
sensitive physiological marker for monitoring cancer progression and response to therapy. PSA can 
also identify the prostate as the origin of a metastatic tumor. (Brawer, M. K. and Lange, P. H. (1989) 

20 Urology 33:1 1-16). 

The kallikreins are a subfamily of serine proteases. KLK14 is a kallikrein gene located within 
the human kallikrein locus at 19ql3.4. KLK14 is approximately 5.4 kb in length and transcribes two 
alternative transcripts present only in prostate and skeletal muscle. In prostate, KLK14 is expressed 
by both benign and malignant glandular epithelial cells, thus exhibiting an expression pattern similar 

25 to that of two other prostatic kallikreins, KLK2 and KLK3, which encode K2 and prostate-specific 
antigen, respectively (Hooper, J.D. et al. (2001) Genomics 73:117-122). 

The signal peptidase is a specialized class of SP found in all prokaryotic and eukaryotic cell 
types that serves in the processing of signal peptides from certain proteins. Signal peptides are 
amino-terminal domains of a protein which direct the protein from its ribosomal assembly site to a 

30 particular cellular or extracellular location. Once the protein has been exported, removal of the signal 
sequence by a signal peptidase and posttranslational processing, e.g., glycosylation or 
phosphorylation, activate the protein. Signal peptidases exist as multi-subunit complexes in both 
yeast and mammals. The canine signal peptidase complex is composed of five subunits, all 
associated with the microsomal membrane and containing hydrophobic regions that span the 

35 membrane one or more times (Shehiess, G.S. and G. Blobel (1990) J. Biol. Chem. 265:9512-9519). 
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Some of these subunits serve to fix the complex ui its proper position on the membrane while others 
contain the actual catalytic activity. 

Another family of proteases which have a serine in their active site are dependent on the 
hydrolysis of ATP for their activity. These proteases contain proteolytic core domains and regulatory 

5 ATPase domains which can be identiiSed by the presence of the P-Ioop, an ATP/GTP-binding motif 
(PROSrrE PDOC00803). Members of this family include the eukaryotic mitochondrial matrix 
proteases, Clp protease and the proteasome. Clp protease was originally found in plant chloroplasts 
but is believed to be widespread in both prokaryotic and eukaryotic cells. The gene for early-onset 
torsion dystonia encodes a protein related to Clp protease (Ozelius, LJ. et al. (1998) Adv. Neurol. 

10 78:93-105). 

The proteasome is an intracellular protease complex found in some bacteria and in all 
eukaryotic cells, and plays an important role in cellular physiology. Proteasomes are associated with 
the ubiquitin conjugation system (UCS), a major pathway for the degradation of cellular protems of 
all types, including proteins that function to activate or repress cellular processes such as transcription 

15 and cell cycle progression (Ciechanover, A. (1994) Cell 79: 13-21). In the UCS pathway, proteins 
targeted for degradation are conjugated to ubiquitin, a small heat stable protein. The ubiquitinated 
protein is then recognized and degraded by the proteasome. The resultant ubiquitin-peptide complex 
is hydrolyzed by a ubiquitin carboxyl termmal hydrolase, and free ubiquitin is released for 
reutilization by the UCS. Ubiquitin-proteasome systems are implicated in the degradation of mitotic 

20 cyclic kinases, oncoproteins, tumor suppressor genes (p53), cell surface receptors associated with 

signal transduction, transcriptional regulators, and mutated or damaged proteins (Ciechanover, supra\ :r. 
This pathway has been implicated in a number of diseases, including cystic fibrosis, Angelman's 
syndrome, and Liddle syndrome (reviewed in Schwartz, A.L. and A. Ciechanover (1999) Ann. Rev. 
Med. 50:57-74). A murine proto-oncogene, Unp, encodes a nuclear ubiquitin protease whose 

25 overexpression leads to oncogenic transformation of NIH3T3 cells. The human homologue of this 
gene is consistently elevated in small cell tumors and adenocarcinomas of the lung (Gray, D.A. 

(1995) Oncogene 10:2179-2183). Ubiquitin carboxyl temunal hydrolase is involved in the 
differentiation of a lymphoblastic leukenda cell Ime to a non-dividing mature state (Maki, A. et al. 

(1996) Differentiation 60:59-66). In neurons, ubiquitin carboxyl terminal hydrolase (PGP 9.5) 
30 expression is strong in the abnormal structures that occur in human neurodegenerative diseases 

(Lowe, J. et al. (1990) J. Pathol. 161:153-160). The proteasome is a large (--2000 kDa) multisubunit 
complex composed of a central catalytic core containing a variety of proteases arranged in four seven- 
membered rings with the active sites facing inwards into the central cavity, and terminal ATPase 
subunits covering the outer port of the cavity and regulating substrate entry (for review, see Schmidt, 
35 M. et al. (1999) Curr. Op. Chem. Biol. 3:584-591). 
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Cysteine Proteases 

Cysteine proteases (CPs) are involved in diverse cellular processes ranging from the 
processing of precursor proteins to intracellular degradation. Nearly half of the CPs known are 
present only in viruses. CPs have a cysteine as the major catalytic residue at the active site where 
5 catalysis proceeds via a thioester intermediate and is facilitated by nearby histidine and asparagine 
residues. A glutamine residue is also important, as it helps to form an oxyanion hole. Two important 
CP families include the papain-like enzymes (CI) and the calpains (C2). Papain-like family members 
are generally lysosomal or secreted and therefore are synthesized with signal peptides as well as 
propeptides. Most members bear a conserved motif in the propeptide that may have structural 
10 significance (Karrer, K.M. et al. (1993) Proc. Natl. Acad. Sci. USA 90:3063-3067). Three- 
dimensional structures of papain family members show a bilobed molecule with the catalytic site 
located between the two lobes. Papains include cathepsins B, C, H, L, and S, certain plant allergens 
and dipeptidyl peptidase (for a review, see Rawlings, N.D. and A. J. Barrett (1994) Meth. Enz. 
244:461-486). 

15 Some CPs are expressed ubiquitously, while others are produced only by cells of the immune 

system. Of particular note, CPs are produced by monocytes, macrophages and other cells which 
migrate to sites of inflammation and secrete molecules involved in tissue repair. Overabundance of 
these repair molecules plays a role in certain disorders. In autoimmune diseases such as rheumatoid 
arthritis, secretion of the cysteine peptidase cathepsin C degrades collagen, laminin, elastin and other 

20 stmctural proteins found in the extracellular matrix of bones. Bone weakened by such degradation is 
also more susceptible to tumor invasion and metastasis. Cathepsin L expression may also contribute 
to the influx of mononuclear cells which exacerbates the destruction of the rheumatoid synovium 
(Keyszer, G.M. (1995) Arthritis Rheum. 38:976-984). 

Calpains are calcium-dependent cytosolic endopeptidases which contain both an N-terminal 

25 catalytic domain and a C-terminal calcium-binding domain. Calpain is expressed as a proenzyme 

heterodinoer consisting of a catalytic subunit unique to each isoform and a regulatory subunit common 
to different isoforms. Each subunit bears a calcium-binding EF-hand domain. The regulatory subunit 
also contains a hydrophobic glycine-rich domain that allows the enzyme to associate with cell 
membranes. Calpains are activated by increased intracellular calcium concentration, which induces a 

30 change in conformation and limited autolysis. The resultant active molecule requires a lower calcium 
concentration for its activity (Chan S.L. and Mattson M.P. (1999) J. Neurosci. Res. 58:167-190). 
Calpain expression is predominantly neuronal, although it is present in other tissues. Several chronic 
neurodegenerative disorders, including ALS, Parkinson's disease and Alzheimer's disease are 
associated with increased calpain expression (Chan and Mattson, supra) . Calpain-mediated 

35 breakdown of the cytoskeleton has been proposed to contribute to brain damage resulting from head 
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injury (McCracken E. et al. (1999) J. Neurotrauma 16:749-61). Calpain-3 is predominantly expressed 
in skeletal muscle, and is responsible for limb-girdle muscular dystrophy type 2A (Minami, N. et al 
(1999) J. NeuroL Sci. 171:3 1--37). 

Another family of thiol proteases is the caspases, which are involved in the initiation and 
5 execution phases of apoptosis. A pro-apoptotic signal can activate initiator caspases tiiat trigger a 
proteolytic caspase cascade, leading to the hydrolysis of target proteins and the classic apoptotic 
death of the cell. Two active site residues, a cysteine and a histidine, have been implicated in the 
catalytic mechanism. Caspases are among the most specific endopeptidases, cleaving after aspartate 
residues. Caspases are synthesized as inactive zymogens consisting of one large (p20) and one small 

10 (plO) subunit separated by a small spacer region, and a variable N-terminal prodomain. This 

prodomain interacts with cofactors that can positively or negatively affect apoptosis. An activating 
signal causes autoproteolytic cleavage of a specific aspartate residue (D297 in the caspase-1 
numbering convention) and removal of the spacer and prodomain, leaving a pl0/p20 heterodimer. 
Two of these heterodimers interact via their small subunits to form the catalytically active tetramer. 

15 The long prbdomains of some caspase family members have been shown to promote dimerization and 
auto-processing of procaspases. Some caspases contain a "death effector domain" in their prodomain 
by which they can be recruited into self-activating conq)lexes with other caspases and FADD protein 
associated death receptors or the TNF receptor complex. In addition, two dimers from different 
caspase family members can associate, changing the substrate specificity of the resultant tetramer. 

20 Endogenous caspase inhibitors (inhibitor of apoptosis proteins, or lAPs) also exist. All these 
interactions have clear effects on the control of apoptosis (reviewed in Chan and Mattson, supra : 
Salveson, G.S. and V.M. Dixit (1999) Proc. Nat. Acad. Sci. USA 96: 10964-10967). 

Caspases have been implicated in a number of diseases. Mice lacking some caspases have 
severe nervous system defects due to failed apoptosis in the neuroepithelium and suffer early 

25 lethality. Others show severe defects in the inflammatory response, as caspases are responsible for 
processing IL-lb and possibly other inflammatory cytokines (Chan and Mattson, supra) . Cowpox 
virus and baculoviruses target caspases to avoid the death of their host cell and promote successful 
, infection. In addition, increases in inappropriate apoptosis have been reported in AIDS, 

neurodegenerative diseases and ischemic injury, while a decrease in cell death is associated with 

30 cancer (Salveson and Dixit, supra : Thompson, C.B. (1995) Science 267: 1456-1462). 
Aspartvl proteases 

Aspartyl proteases (APs) include the lysosomal proteases cathepsins D and E, as well as 
chymosin, renin, and the gastric pepsins. Most retroviruses encode an AP, usually as part of the pol 
polyprotein. APs, also called acid proteases, are monomeric enzymes consisting of two domains, 
35 each domain containing one half of the active site with its own catalytic aspartic acid residue, APs 
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are most active in the range of pH 2-3, at which one of the aspartate residues is ionized and the other 
neutral. The pepsin family of APs contains many secreted enzymes, and all are likely to be 
synthesized with signal peptides and propeptides. Most family members have three disulfide loops, 
the fost -5 residue loop following the first aspartate, the second 5-6 residue loop preceding the 
5 second aspartate, and the third and largest loop occurring toward the C terminus, Retropepsins, on 
the other hand, are analogous to a single domain of pepsin, and become active as homodimers with 
each retropepsin monomer contributing one half of the active site. Retropepsins are required for 
processing the viral polyproteins. 

APs have roles in various tissues, and somfe have been associated with disease. Renin 

10 mediates the first step in processing the hormone angiotensin, which is responsible for regulating 
electrolyte balance and blood pressure (reviewed in Crews, D.E. and S.R. Williams (1999) Hum. 
Biol. 71:475-503). Abnormal regulation and expression of cathepsins are evident in various 
inflammatory disease states. Expression of cathepsin D is elevated in synovial tissues from patients 
with rheumatoid arthritis and osteoarthritis. The mcreased expression and differential regulation of 

15 the cathepsins are linked to the metastatic potential of a variety of cancers (Chambers, A.F. et al. 
(1993) Crit. Rev. Oncol. 4:95-114). 
Metalloproteases 

Metalloproteases require a metal ion for activity, usually manganese or zinc. Most zinc- 
dependent metalloproteases share a common sequence in the zinc-binding domain. The active site is 
20 made up of two histidines which act as zmc ligands and a catalytic glutamic acid C-terminal to the 
first histidine. Proteins containing this signature sequence are known as the metzincins and include 
aminopeptidase N, angiotensin-converting enzyme, neurolysin, the matrix metalloproteases and the 
adamalysins (ADAMS). An alternate sequence is found in the zinc carboxypeptidases, in which all 
three conserved residues - two histidines and a glutamic acid - are involved in zinc binding. 
25 A number of the neutral metalloendopeptidases, including angiotensin converting enzyme and 

. the aminopeptidases, are involved in the metabolism of peptide hormones. High aminopeptidase B 
activity, for example, is found in the adrenal glands and neurohypophyses of hypertensive rats (Prieto, 
1. Et al. (1998) Horm. Metab. Res. 30:246-248). Oligopeptidase M/neurolysin can hydrolyze 
bradykinin as well as neurotensin (Serizawa, A. et al. (1995) J. Biol. Chem 270:2092-2098). 
30 Neurotensin is a vasoactive peptide that can act as a neurotransmitter in the brain, where it has been 
implicated m limiting food intake (Tritos, N.A. et al. (1999) Neuropeptides 33:339-349). 

The matrix metalloproteases (MMPs) are a family of at least 23 enzymes that can degrade 
• components of the extracellular matrix (ECM). They are Zn""^ endopeptidases with an N-terminal 
catalytic domain. Nearly all members of the family have a hinge peptide and C-terminal domain 
35 which can bind to substrate molecules in the ECM or to inhibitors produced by the tissue (TIMPs, for 
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tissue inhibitor of metalloprotease; Campbell, I.L. et al. (1999) Trends Neurosci. 22:285). The 
presence of fibronectin-like repeats, transmembrane domains, or C-terminal hemopexinase-like 
domains can be used to separate MMPs into collagenase, gelatinase, stromelysin and membrane-type 
MMP subfamilies, hi the inactive form, the Zn"^^ ion in the active site interacts with a cysteine in the 
5 pro-sequence. Activating factors disrupt the Zn'*^^-cysteine interaction, or "cysteine switch," exposing 
the active site. This partially activates the enzyme, which then cleaves off its propeptide and becomes 
fiiUy active. MMPs are often activated by the serine proteases plasmin and furin. MMPs are often 
regulated by stoichiometric, noncovalent interactions with inhibitors; the balance of protease to 
inhibitor, then, is very important in tissue homeostasis (reviewed in Yong, V.W. et al. (1998) Trends 

10 Neurosci. 21:75). 

MMPs are implicated in a number of diseases including osteoarthritis (Mitchell, P. et al. 
. (1996) J. Clin. Inv. 97:761), atherosclerotic plaque rupture (Sukhova, G.K. et al. (1999) Circulation 
99:2503), aortic aneurysm (Schneiderman, J. et al. (1998) Am. J. Path. 152:703), non-healing wounds 
(Saarialho-Kere, U.K. et al. (1994) J. Clin. Inv. 94:79), bone resorption (Blavier, L. and J.M. Delaisse 

15 (1995) J. Cell Sci. 108:3649), age-related macular degeneration (Steen, B. et al. (1998) Invest. 

Ophthalmol. Vis. Sci. 39:2194), enq>hysema (Finlay, G.A. et al. (1997) Thorax 52:502), myocardial 
mfarction (Rohde, L.E. et al. (1999) Circulation 99:3063) and dilated cardiomyopathy (Thomas, C.V. 
et al. (1998) Circulation 97:1708). MMP inhibitors prevent metastasis of mammary carcinoma and 
experimental tumors in rat, and Lewis lung carcinoma, henoangioma, and human ovarian carcinoma 

20 xenografts in mice (Eccles S.A. et al. (1996) Cancer Res. 56:28 15; Anderson et al. (1996) Cancer 
Res. 56:715-718; Volpert, O.V. et al. (1996) J. Clin. Invest. 98:671; Taraboletti, G. et al. (1995) JNCI 
87:293; Davies, B. et al. (1993) Cancer Res. 53:2087). MMPs may be active in Alzheimer's disease. 
A number of MMPs are inq)licated in multiple sclerosis, and administration of MMP inhibitors can 
relieve some of its synqjtoms (reviewed in Yong, supra) . 

25 Another family of metalloproteases is the ADAMs, for A Disintegrin and Metalloprotease 

Domain, which they share with their close relatives the adamalysins, snake venom metalloproteases 
(SVMPs). ADAMs combine features of both cell surface adhesion molecules and proteases, 
containing a prodomain, a protease domain, a disintegrin domain, a cysteine rich domain, an . 
epidermal growth factor repeat, a transmembrane domain, and a cytoplasmic tail. The first three 

30 domains listed above are also found in the SVMPs. The ADAMs possess four potential functions: 
proteolysis, adhesion, signalmg and fusion. The ADAMs share the metzincin zinc binding sequence 
and are inhibited by some MMP antagonists such as TIMP-l. 

ADAMs are implicated in such processes as sperm-egg binding and fusion, myoblast ftision, 
and protein-ectodomain processing or shedding of cytokines, cytokine receptors, adhesion proteins 

35 and other extracellular protein domains (Schlondorff, J. and CP. Blobel (1999) J. Cell. Sci. 
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112:3603-3617). The Kuzbanian protein cleaves a substrate in the NOTCH pathway (possibly 
NOTCH itself), activating the program for lateral inhibition in Drosophila neural development. Two 
ADAMs, TACE (ADAM 17) and ADAM 10, are proposed to have analogous roles in the processing 
of amyloid precursor protein in the brain (Schlondorff and Blobel, supra) . TACE has also been 
5 identified as the TNF activating enzyme (Black, R,A. et al. (1997) Nature 385:729). TNF is a 

pleiotropic cytokine that is important in mobilizing host defenses in response to infection or trauma, 
but can cause severe damage in excess and is often overproduced in autoimmune disease. TACE 
cleaves membrane-bound pro-TNF to release a soluble form. Other ADAMs may be involved in a 
similar type of processing of other membrane-bound molecules. 

10 The ADAMTS sub-family has all of the features of ADAM family metalloproteases and 

contain an additional thrombospondin domain (TS). The prototypic ADAMTS was identified in 
mouse, found to be expressed in heart and kidney and upregulated by proinflammatory stimuli (Kuno, 
K. et al. (1997) J. Biol. Chem. 272:556), To date eleven members are recognized by the Human 
Genome Organization (HUGO; http://www.gene.ucl.ac.Uk/users/hester/adamts.htnil#Approved). 

15 Members of this family have the ability to degrade aggrecan, a high molecular weight proteoglycan 
which provides cartilage with nnportant mechanical properties including compressibility, and which 
is lost during the development of arthritis. Enzymes which degrade aggrecan are thus considered 
attractive targets to prevent and slow the degradation of articular cartilage (See, e.g., Tortorella, M.D. 
(1999) Science 284:1664; Abbaszade, I. (1999) J. Biol. Chem. 274:23443). Other members are 

20 reported to have antiangiogenic potential (Kuno et al., sugra) and/or procollagen processing (Colige, 
A. et al. (1997) ProcNatl. Acad. Sci. USA 94:2374). 

All members of the MDC family of integral membrane proteins contain a 
metalloproteinase-like domain, a disintegrin-like domain and a cysteine-rich domain. They have been 
identified in a wide range of mammalian tissues and many are abundantly expressed in the male 

25 reproductive tract. A number of MDC proteins (fertilin alpha, fertilin beta, tMDC I, tMDC H and 
tMDC ni) are localized to spermatogenic cells and processed as spermatozoa pass through the 
epididymis, yielding proteins that retain then- disintegrin domain on mature spermatozoa. Fertilin beta 
and tMDC I have been implicated in egg recognition, mediated by a disintegrin-integrin interaction 
(Frayne, J. et al. (1998) J. Reprod. Fertil. Suppl. 53:149-155). 

30 Examples of manganese metalloenzymes include aminopeptidase P and human proline 

dipeptidase (FEPD). Aminopeptidase P can degrade bradykinin, a nonapeptide activated in a variety 
of inflammatory responses. Aminopeptidase P has been implicated in coronary ischemia/reperfusion 
injury. Administration of aminopeptidase P inhibitors has been shown to have a cardioprotective 
effect in rats (Ersahin, C. et al (1999) J. Cardiovasc. Pharmacol. 34:604-611). 

35 Protease inhibitors 

10 
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Protease inhibitors and other regulators of protease activity control the activity and effects of 
proteases. Protease inhibitors have been shown to control pathogenesis in animal models of 
proteolytic disorders (Murphy, G. (1991) Agents Actions Suppl. 35:69-76). In patients with HIV 
disease protease inhibitors have been shown to be effective in preventing disease progression and 
5 reducing mortality (Barry, M. et al. (1997) Clin. Pharmacokinet. 32: 194-209). Low levels of the 
cystatins, low molecular weight inhibitors of the cysteine proteases, correlate with malignant 
progression of tumors. (Calkins, C. et al. (1995) Bioh Biochem. Hoppe Seyler 376:71-80). The 
cystatin superf amily of protease inhibitors is characterized by a particular pattern of linearly arranged 
and tandemly repeated disulfide loops (Kellermann, J. et al. (1989) J. Biol. Chem. 264:14121-14128). 

10 An example of a representative of a structural prototype of a novel family among the cystatin 

superfamily is human alpha 2-HS glycoprotein (AHSG), a plasma protein synthesized in liver and 
selectively concentrated in bone matrix, dentine, and other mineralized tissues (Triffitt, J.T. (1976) 
Calcif. Tissue Res. 22:27-33), which is also classified as belonging to the fetuin family. Fetuins are 
characterized by the presence of 2 N-terminally located cystatin-like repeats and a unique C-terminal 

15 domain which is not present in other proteins of the cystatin superfamily (PROSITE PDOC00966). 
AHSG has been reported to be involved in bone formation and resorption as well as immune 
responses (Yang, F.etal. (1992) 1130:149-156; Lee, CCetal. (1987) PNAS USA 84:4403-4407; 
Nakamura, O. et al. (1999) Biosci. BiotechnoL Biochem. 63:1383-1391). AdditionaUy, AHSG has 
been implicated in infertility associated with endometriosis (Mathur, S.P. (2000) Am. J. Reprod. 

20 Immunol. 44:89-95; Mathur, S.P. et al. (1999) Autoimmunity 29: 121-127) and inhibition of 

osteogenesis (Binkert, C. et al, (1999) J. Biol Chem. 274:28514-28520). Decreased serum levels of 
AHSG have been detected in patients with acute leukemias, chronic granulocyte and myelomonocyte 
leukemias, lymphomas, myelofibrosis, multiple myeloma, metastatizing solid tumors, systemic lupus 
erythematosus, rheunaatoid arthritis, acute alcoholic hepatitis, fatty liver, chronic active hepatitis, 

25 liver cirrhosis, acute and chronic pancreatitis, and Crohn's disease (Kalabay, L. et al, (1992) Qrv. 
Hetil. 133:1553-1554; 1559-1560). Serpins are inhibitors of mammalian plasma serine proteases^ 
Many serpins serve to regulate the blood clotting cascade and/or the complement cascade in 
mammals. Sp32 is a positive regulator of the mammalian acrosomal protease, acrosin, that binds the 
proenzyme, proacrosin, and thereby aides in packaging the enzyme into the acrosomal matrix (T. 

30 Baba et al. (1994) J. Biol. Chem. 269:10133-10140). The Kunitz family of serine protease inhibitors 
are characterized by one or more "Kunitz domains" containing a series of cysteine residues that are 
regularly spaced over approximately 50 amino acid residues and form three intrachain disulfide 
bonds. Members of this family include aprotinin, tissue factor pathway inhibitor (TFPI-1 and TFPI- 
2), inter-a-trypsui inhibitor (ITI), and bikunin. (Marlor, CW. et al. (1997) J. Biol. Chem. 272:12202- 

35 12208.) Members of this family are potent inhibitors (in the nanomolar range) against serine 
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proteases such as kallikrein and plasmin. has clinical utility in reduction of perioperative blood loss, 
rri has been found to inactivate hunotan trypsin, chymotrypsin, neutrophil elastase and cathepsin G 
(Morii, M. et al (1985) Biol. Chem. Hoppe Seyler 366:19-21); and is suspected of playing a key role 
in the biology of the extracellular matrix and in the pathophysiology of chronic bronchopulmonary 
5 diseases or lung cancer progression (Cuvelier, A. et al. (2000) Rev. Mai. Respir. 17:437-446). 

Eppin (Epididymal protease mhibitor) is a family of protease inhibitors expressed in the 
epididymis and testis. Tv^o eppm isoforms contain both Kunitz-type and WAP-type four disulfide 
core protease inhibitor consensus sequences. Eppin-1 is expressed only in the testis and epididymis; 
Eppin-2 is expressed only in the epididymis and Eppin-3 only in the testis (Richardson, R.T. et al. 

10 (2001) Gene 270:93-102). 

Hunaan cystatin C is a potent inihibitor of cysteine proteases. Further, it has amyloidogenic 
properties. It refolds to produce very tight two-fold symmetric dimers while retaining the secondary 
structure of the monomeric form. The structvire suggests a mechanism for its aggregation in the brain 
arteries of elderly people with amyloid angiopathy. A more severe 'conformational disease' is 

15 associated with the L68Q mutant of hun^n cystatin C, which causes massive amyloidosis, cerebral 
hemorrhage, and death in young adults (Janowski, R. et al. (2001) Nat. Struct. Biol. 8(4):316-20). 

A major portion of all proteins synthesized in eukaryotic cells are synthesized on the 
cytosolic surface of the endoplasmic reticuliun (ER). Before these immature proteins are distributed 
to other organelles in the cell or are secreted, they must be transported into the interior lumen of the 

20 ER where post-translational modifications are performed. These modifications include protein 
folding and the formation of disulfide bonds, and N-linked glycosylations. 
Protein Isomerases 

Protem folding in the ER is aided by two principal types of protein isomerases, protein 
disulfide isomerase (PDI), and peptidyl-prolyl isomerase (PPI). PDI catalyzes the oxidation of free 

25 sulfhydryl groups in cysteine residues to form intramolecular disulfide bonds in proteins. PPI, an 
enzyme that catalyzes the isomerization of certain proline imidic bonds in oligopeptides and proteins, 
is considered to govern one of the rate limiting steps in the folding of many proteins to their final 
functional conformation. The cyclophilins represent a major class of PPI that was originally 
identified as the major receptor for the immunosuppressive drug cyclosporin A (Handschumacher, 

30 R.E, et al. (1984) Science 226: 544-547). 
Protein Glvcosvlation 

The glycosylation of most soluble secreted and membrane-bound proteins by 
oligosaccharides linked to asparagine residues in proteins is also performed in the ER. This reaction 
is catalyzed by a membrane-bound enzyme, oligosaccharyl transferase. Although the exact purpose 

35 of this "N-linked" glycosylation is imknown, the presence of oligosaccharides tends to make a 
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glycoprotein resistant to protease digestion. In addition, oligosaccharides attached to cell-surface 
proteins called selectins are known to function in cell-cell adhesion processes (Alberts, B. et aL 
(1994) Molecular Biology of the Cell Garland Publishing Co., New York, NY. p.608). "O-linked" 
glycosylation of proteins also occurs in the ER by the addition of N-acetylgalactosamine to the 
5 hydroxy! group of a serine or threonine residue followed by the sequential addition of other sugar 
residues to the first. This process is catalyzed by a series of glycosyltransferases each specific for a 
particular donor sugar nucleotide and acceptor molecule (Lodish, H. et aL (1995) Molecular Cell 
Biology, W. H. Freeman and Co., New York, NY pp.700-708). In many cases, both - and 0-linked 
oligosaccharides appear to be required for the secretion of proteins or the movement of plasma 

10 membrane glycoproteins to the cell surface. For example, one of the glycosyltransferases in the 
dolichol pathway, dolichol phosphate mannose synthase,, is required in N: -glycosylation, 
O-mannosylation, and glycosylphosphatidylinositol membrane anchoring of protem (Tomita, S. et al. 
(1998) J. Biol. Chem. 9249-9254). Thus, m many cases, both N- and 0-linked oligosaccharides 
appear to be required for the secretion of proteins or the movement of plasma membrane 

15 . glycoproteins to the cell surface. 

An additional glycosylation mechanism operates in the ER specifically to target lysosomal 
enzymes to lysosomes and prevent thek secretion. Lysosomal enzymes in the ER receive an N- 
linked oligosaccharide, like plasma membrane and secreted proteins, but are then phosphorylated on 
one or two mannose residues. The phosphorylation of mannose residues occurs in two steps, the first 

20 step being the addition of an N-acetylglucosamine phosphate residue by N-acetylglucosamine 
phosphotransferase, and the second the removal of the N-acetylglucosamine group by 
phosphodiesterase. The phosphorylated mannose residue then targets the lysosomal enzyme to a 
mannose 6-phosphate receptor which transports it to a lysosome vesicle (Lodish et al. supra , pp. 708- 
711). 

25 

Chaperones 

Molecular chaperones are proteins that aid in the proper folding of immature proteins and 
refolding of improperly folded ones, the assembly of protein subunits, and in the transport of 

30 unfolded proteins across membranes. Chaperones are also called heat-shock proteins (hsp) because 
of their tendency to be expressed in dramatically increased amounts following brief exposure of cells 
to elevated temperatures. This latter property most likely reflects their need in the refolding of 
proteins that have become denatured by the high temperatures. Chaperones may be divided into 
several classes according to their location, function, and molecular weight, and include hsp60, TCPl, 

35 hsp70, hsp40 (also called DnaJ), and hsp90. For example, hsp90 binds to steroid hormone receptors. 
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represses transcription in the absence of the ligand, and provides proper folding of the ligand-binding 
domain of the receptor in the presence of the honnone (Burston, S.G. and A.R. Clarke (1995) Essays 
Biochem. 29:125-136). Hsp60 and hsp70 chaperones aid in the transport and folding of newly 
synthesized proteins. Hsp70 acts early in protein folding, binding a newly synthesized protein before 
5 it leaves the ribosome and transporting the protein to the mitochondria or ER before releasing the 
folded protein. Hsp60, along with hsplO, binds misfolded proteins and gives them the opportunity to 
refold correctly. All chaperones share an affinity for hydrophobic patches on incompletely folded 
proteins and the ability to hydrolyze ATP. The energy of ATP hydrolysis is used to release the hsp- 
bound protein in its properly folded state (Alberts, B. et al. supra , pp 214, 571-572). 

10 Dipeptidyl-peptidase I, a lysosomal cysteine proteinase, is important in intracellular 

degradation of proteins and appears to be a central coordinator for activation of many serine 
proteinases in inounune/inflammatory cells. The gene has been mapped to chromosomal region 
1 Iql4.1-ql4.3. Dipeptidyl-peptidase I is expressed at high levels in lung, kidney, and placenta, and 
also at higli levels in polymorphonuclear leukocytes and alveolar macrophages and their precursor 

15 cells (Rao, N.V. et al. (1997) J. Biol. Chem.272: 10260-10265). 

lAP is a protein family that has baculovirus lAP repeat (BIR) domains and inhibits apoptosis. 
A human lAP family gene, ApoUon, encodes a 530 kDa protein that contains a single BIR domain and 
a ubiquitin-conjugating enzyme domain. ApoUon has been observed to protect cells from undergoing 
apoptosis and implicated in tumorigenesis and drug resistance ((Then, Z. et al. (1999) Biochem. 

20 Biophys. Res. Commun. 264:847-854). 

The RTVL-H family is a medium repetitive family of endogenous retrovirus-like sequences 
found in the genomes of humans and other primates. Different subfamilies of RTVL-H elements are 
designated Type I, Type la, and Type n (Goodchild, N.L. (1993) Virology 196:778-788). 
Lvsvl Hydroxylases 

25 Lysyl hydroxylase is an enzyme involved in collagen biosynthesis. Collagens are a family of 

fibrous structural proteins that are found in essentially all tissues. Collagens are the most abundant 
proteins in mammals, and are essential for the formation of connective tissue such as skin, bone, 
tendon, cartilage, blood vessels and teeth. Members of the collagen family can be distinguished from 
one another by the degree of cross-linking between collagen fibers and by the number of carbohydrate 

30 units (e.g., galactose or glucosylgalactose) attached to the collagen fibers. Hydroxylated lysine 
residues (hydroxylysine) are essential for stability of cross-linking and as attachment points for 
carbohydrate units. 

The enzyme lysyl hydroxylase catalyzes the hydroxylation of lysine residues to form 
hydroxylysine. Lysyl hydroxylase targets the lysine residue of the sequence, X-lys-gly (lys = lysine, 
35 gly = glycine, and X = any amino acid residue). Three isoforms of lysyl hydroxylase have been 
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characterized, termed LHl (or PLOD; procollagen-lysine, 2-oxogIutarate 5-dioxygenase), LH2 (or 
PL0D2), and LH3. The three enzymes share 60% sequence identity overall, with even higher 
similarity in the C-terminal region. In addition, there are regions in the middle of the molecule that 
have an identity of more than 80% (Valtavaara, M. et al (1998) J. Biol. Chem. 273:12881-12886). 
5 Diminished lysyl hydroxylase activity is involved in certain connective tissue disorders, hi 

particular mutations, including a truncation and duplications within the coding region of the gene for 
PLOD, have been described in patients with type VI Ehlers-Danos syndrome (Hyland, J. et al. (1992) 
Nature Genet. 2:228-31; Hautala, T. et al. (1993) Genomics 15:399-404). 
Ubiquitin-Associated Proteins 

10 The ubiquitin conjugation system (UCS), is a major pathway for the degradation of cellular 

proteins of all types, including protems that function to activate or repress cellular processes such as 
transcription, cell cycle progression, and unmune recognition (Ciechanover, A. (1994) Cell 79:13-21). 
The process of ubiquitin conjugation and protein degradation involves several steps (Jentsch, S. 
(1992) Annu. Rev. Genet. 26: 179-207). First ubiquitin (Ub), a small, heat stable protein is activated 

15 by a ubiquitin-activating enzyme (El) in an ATP dependent reaction which binds the C-terminus of 
Ub to the thiol group of an internal cysteine residue in EL Activated Ub is then transferred to one of 
several Ub-conjugating enzymes (E2). Different ubiquitin-dependent proteolytic pathways employ 
structurally sunilar, but distinct ubiquitin-conjugating enzymes that are associated with recognition 
subunits which direct them to proteins carryuig a particular degradation signal. E2 then transfers the 

20 Ub molecule through its C-ternmial glycine to a member of the ubiquitin-protein ligase family, E3. 

Next, E3 transfers the Ub molecule to the target protein. Additional Ub molecules may be added to ^ 
the target protein forming a multi-Ub chain structure. The ubiquitinated protein is then recognized 
and degraded by the proteasome, an intracellular protease complex found in some bacteria and in all 
eukaryotic cells. The resultant ubiquitin-peptide complex is hydrolyzed by a ubiquitin carboxyl 

25 terminal hydrolase, and free ubiquitin is released for reutilization by the UCS. 

Ubiquitin-proteasome systems are implicated in the degradation of mitotic cyclic kinases, 
oncoproteins, tumor suppressor genes (p53), cell surface receptors associated with signal 
transduction, transcriptional regulators, and mutated or damaged proteins (Ciechanover, supra) . This 
pathway has been implicated in a number of diseases, including cystic fibrosis, Angelman's 

30 syndrome, and Liddle syndrome (reviewed in Schwartz, A.L. and A. Ciechanover (1999) Annu. Rev. 
Med. 50:57-74). A murine proto-oncogene, Unp, encodes a nuclear ubiquitin protease whose 
overexpression leads to oncogenic transformation of NIH3T3 cells. The human homologue of this 
gene is consistently elevated in small cell tumors and adenocarcinomas of the lung (Gray, D.A. 
(1995) Oncogene 10:2179-2183). Ubiquitin carboxyl terminal hydrolase is involved in the 

35 differentiation of a lymphoblastic leukemia cell line to a non-dividing mature state (Maki, A. et al. 

15 
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(1996) Differentiation 60:59-66). In neurons, ubiquitin carboxyl terminal hydrolase (PGP 9.5) 
expression is strong in the abnormal structures that occur in human neurodegenerative diseases 
. (Lowe, J. et al. (1990) J. PathoL 161:153-160). 

Additional ubiqiiitin-like proteins which also possess the ability to covalently modify other 
5 cellular proteins have been identified in recent years. (For review, see Yeh, E.T.H. et al. (2000) Gene 
248:144; and Jentsch, S. and Pyrowolalds, G. (2000) Trends Cell Biol. 10:335-342.) These 
ubiquitin-like protein modifiers include the sentrins (also known as SUMO proteins), NEDD8, and 
Apgl2. The conjugation pathways for these proteins closely resemble that for ubiquitin. For 
example, conjugation of sentrin requires the El heterodimer AOS1/UBA2, and a single E2 enzyme, 

10 UBC9. The recently .discovered protein S3 may function as a sentrin ligase. The yeast protein Ulpl 
is a sentrin hydrolase. Inactivation of Ulpl in yeast results in severe cell cycle defects. In humans, 
seven sentrin specific proteases (SENP) have been identified, which range in size from 238 to 11 12 
amino acid residues (Yeh, supra ). All human SENPs share a conserved C-terminal domain. The N- 
terminal regions may regulate cellular location and substrate specificity. 

15 Sentrinization does not promote protein degradation as does ubiquitin. In some cases 

sentrinization appears to be important for stable localization of target proteins in nuclear bodies. 
Substrates for sentrinization include PML, a RING finger protein with tumor suppressor activity, 
HIPK2, a co-repressor for homeodomain transcription factors, and the tumor suppressor p53. iKBa, a 
cytosolic inhibitor of NFkB, a transcription factor involved in induction of inflammation associated 

20 proteins, is also a substrate for sentrinization. Sentrinized IxBa cannot be ubiquitinated and is 
resistant to proteasomal degradation, suggesting links between the ubiquitin and sentrin pathways. 
Jentsch, supra) . 

Expression profiling 

25 Microarrays are analytical tools used in bioanalysis. A microarray has a plurality of 

molecules spatially distributed over, and stably associated with, the surface of a soHd support. 
Microarrays of polypeptides, polynucleotides, and/or antibodies have been developed and find use in 
a variety of applications, such as gene sequencing, monitoring gene expression, gene ntiapping, 
bacterial identification, drug discovery, and combinatorial chemistry. 

30 One area in particular in which microarrays find use is in gene expression analysis. Array 

technology can provide a simple way to explore the expression of a single,polymorphic gene or the 
expression profile of a large number of related or unrelated genes. When the expression of a single 
gene is examined, arrays are employed to detect the expression of a specific gene or its variants. 
When an expression profile is examined, arrays provide a platform for identifying genes that are 

35 tissue specific, are affected by a substance being tested in a toxicology assay, are part of a signalmg 
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cascade, carry out housekeeping functions, or are specifically related to a particular genetic 
predisposition, condition, disease, or disorder. 
Steroids affecting protein modification 

Steroids are a class of lipid-soluble molecules, including cholesterol, bile acids, vitamin D, 
5 and hormones, that share a common four-ring structure based on cyclopentanoperhydrophenanthrene 
and that carrry out a wide variety of functions. Cholesterol, for example, is a component of cell 
membranes that controls membrane fluidity. It is also a precursor for bile acids which solubilize 
lipids and facilitate absorption in the small intestine during digestion. Vitamin D regulates the 
absorption of calcium in the smiall intestine and controls the concentration of calcium in plasma. 

10 Steroid hormones, produced by the adrenal cortex, ovaries, and testes, include glucocorticoids, 

mineralocorticoids, androgens, and estrogens. They control various biological processes by binding 
to intracellular receptors that regulate transcription of specific genes in the nucleus. Glucocorticoids, 
for example, increase blood glucose concentrations by regulation of gluconeogenesis in the liver, 
increase blood concentrations of fatty acids by promoting lipolysis in adipose tissues, modulate 

15 sensitivity to catcholamines ha the central nervous system, and reduce inflammation. The principal 
mineralocorticoid, aldosterone, is produced by the adrenal cortex and acts on cells of the distal 
tubules of the kidney to enhance sodium ion reabsorption. Androgens, produced by the interstitial 
cells of Leydig in the testis, include the male sex hormone testosterone, which triggers changes at 
puberty, the production of sperm and maintenance of secondary sexual characteristics. Female sex 

20 hormones, estrogen and progesterone, are produced by the ovaries and also by the placenta and 
adrenal cortex of the fetus during pregnancy. Estrogen regulates female reproductive processes and 
secondary sexual characteristics. Progesterone regulates changes in the endometrium during the 
menstrual cycle and pregnancy. 

Steroid hormones are widely used for fertility control and in anti-inflammatory treatments for 

25 physical injuries and diseases such as arthritis, asthma, and auto-immune disorders. Progesterone, a 
naturally occurring progestin, is primarily used to treat amenorrhea, abnormal uterine bleeding, or as 
a contraceptive. Endogenous progesterone is responsible for inducing secretory activity in the 
endometrium of the estrogen-primed uterus in preparation for the implantation of a fertilized egg and 
for the maintenance of pregnancy. It is secreted from the corpus luteum in response to luteinizing 

30 hormone (LH). The primary contraceptive effect of exogenous progestins involves the suppression 
of the midcycle surge of LH. At the cellular level, progestins diffuse freely into target cells and bind 
to the progesterone receptor. Target cells include the female reproductive tract, the mammary gland, 
the hypothalamus, and the pituitary. Once bound to the receptor, progestins slow the frequency of 
release of gonadotropin releasing hormone from the hypothalamus and blunt the pre-ovulatory LH 

35 surge, thereby preventing follicular maturation and ovulation. Progesterone has minimal estrogenic 
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and androgenic activity. Progesterone is metabolized hepatically to pregnanediol and conjugated with 
glucuronic acid. 

Medroxyprogesterone (MAH), also known as 6a-methyl-17-hydroxyprogesterone, is a 
synthetic progestin with a pharmacological activity about 15 times greater than progesterone. MAH 
5 is used for the treatment of renal and endometrial carcinomas, amenorrhea, abnormal uterine 

bleeding, and endometriosis associated with hormonal imbalance. MAH has a stimulatory effect on 
respiratory centers and has been used in cases of low blood oxygenation caused by sleep apnea, 
chronic obstructive pulmonary disease, or hypercapnia. 

Mifepristone, also known as RU-486, is an antiprogesterone drug that blocks receptors of 

10 progesterone. It counteracts the effects of progesterone, which is needed to sustain pregnancy. 
Mifepristone induces spontaneous abortion when administered in early pregnancy followed by 
treatment with the prostaglandin, misoprostol. Further, studies show that mifepristone at a 
substantially lower dose can be highly effective as a postcoital contraceptive when administered 
within five days after unprotected intercourse, thus providing women with a "morning-after pill" in 

15 case of contraceptive failure or sexual assault. Mifepristone also has potential uses in the treatment 
of breast and ovarian cancers in cases in which tumors are progesterone-dependent. It interferes with 
steroid-dependent growth of brain meningiomas, and may be useful in treatment of endometriosis- 
where it blocks the estrogen-dependent growth of endometrial tissues. It may also be useful in 
treatment of uterine fibroid tumors and Gushing' s Syndrome. Mifepristone binds to glucocorticoid 

20 receptors and interferes with Cortisol binding. Mifepristone also may act as an anti-glucocorticoid 
and be effective for treating conditions where Cortisol levels are elevated such as AIDS, anorexia 
nervosa, ulcers, diabetes, Parkinson's disease, multiple sclerosis, and Alzheimer's disease. 

Danazol is a synthetic steroid derived from ethinyl testosterone, Danazol indirectly reduces 
estrogen production by lowering pituitary synthesis of follicle-stimulating hormone and LH. Danazol 

25 also binds to sex hormone receptors in target tissues, thereby exhibiting anabolic, antiestrognic, and 
. weakly androgenic activity. Danazol does not possess any progestogenic activity, and does not 
suppress normal pituitary release of corticotropin or release of Cortisol by the adrenal glands. 
Danazol is used in the treatment of endometriosis to relieve pain and inhibit endometrial cell growth. 
It is also used to treat fibrocystic breast disease and hereditary angioedema. 

30 Corticosteroids are used to relieve inflammation and to suppress the immune response. They 

inhibit eosinophil, basophil, and airway epithelial cell function by regulation of cytokines that 
mediate the inflammatory response. They inhibit leukocyte infiltration at the site of inflammation, 
interfere in the function of mediators of the inflammatory response, and suppress the humoral 
immune response. Corticosteroids are used to treat allergies, asthma, arthritis, and skin conditions. 

35 Beclomethasone is a synthetic glucocorticoid that is used to treat steroid-dependent asthma, to relieve 
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symptoms associated with allergic or nonallergic (vasomotor) rhinitis, or to prevent recurrent nasal 
polyps following surgical removal. The anti-inflammatory and vasoconstrictive effects of intranasal 
beclomethasone are 5000 times greater than those produced by hydrocortisone. Budesonide is a 
corticosteroid used to control symptoms associated with allergic rhinitis or asthma; Budesonide has 
5 high topical anti-inflammatory activity but low systemic activity. Dexamethasone is a synthetic 
glucocorticoid used in anti-inflammatory or immunosuppressive compositions. It is also used in 
inhalants to prevent symptoms of asthma. Due to its greater ability to reach the central nervous 
system, dexamethasone is usually the treatment of choice to control cerebral edema. Dexamethasone 
is approximately 20-30 times more potent than hydrocortisone and 5-7 times more potent than 

10 prednisone. Prednisone is metabolized in the liver to its active form, prednisolone, a glucocorticoid 
with anti-inflammatory properties. Prednisone is approximately 4 times more potent than 
hydrocortisone and the duration of action of prednisone is intermediate between hydrocortisone and 
dexamethasone. Prednisone is used to treat allograft rejection, asthma, systemic lupus erythematosus, 
arthritis, ulcerative colitis, and other inflammatory conditions. Betamethasone is a synthetic 

15 glucocorticoid with antiinflammatory and immunosuppressive activity and is used to treat psoriasis 
and fungal infections, such as athlete's foot and ringworm. 

The anti-inflammatory actions of corticosteroids are thought to involve phospholipase A3 
inhibitory proteins, collectively called lipocortins. Lipocortins, in turn, control the biosynthesis of 
potent mediators of inflanomation such as prostaglandins and leukotrienes by inhibiting the release of 

20 the precursor molecule arachidonic acid. Proposed mechanisms of action include decreased IgE 
synthesis, increased number of P-adrenergic receptors on leukocytes, and decreased arachidonic acid 
metabolism. During an inrmiediate allergic reaction, such as in chronic bronchial asthma, allergens 
bridge the IgE antibodies on the surface of mast cells, which triggers these cells to release 
chemotactic substances. Mast cell influx and activation, therefore, is partially responsible for the 

25 inflammation and hyperiixitability of the oral mucosa in asthmatic patients. This inflammation can be 
retarded by administration of corticosteroids. 
Toxicology Testing: 

Array technology can provide a simple way to explore the expression of a single polymorphic 
gene or the expression profile of a large number of related or umelated genes. When the expression 
30 of a single gene is examiued, arrays are employed to detect the expression of a specific gene or its 
variants. When an expression profile is examined, arrays provide a platform for examining which 
genes are tissue specific, carry out housekeeping functions, are parts of a signaling cascade, or are 
specifically related to a particular genetic predisposition, condition, disease, or disorder. 

The potential application of gene expression profiling is particularly relevant to improving 
35 diagnosis, prognosis, and treatment of disease. For example, both the levels and sequences expressed 
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in tissues from subjects with hyperlipidemia may be compared with the levels and sequences 
expressed in normal tissue. 

Toxicity testing is a mandatory and time-consuming part of drug development programs in 
the pharmaceutical industry. A more rapid screen to determine the effects upon metabolism and to 
5 detect toxicity of lead drug candidates may be the use of gene expression microarrays. For example, 
microarrays of various kinds may be produced using full length genes or gene fragments. These 
arrays can then be used to test samples treated with the drug candidates to elucidate the gene 
expression pattern associated with drug treatment. This gene pattern can be compared with gene 
expression patterns associated with compounds which produce known metabolic and toxicological 
10 responses. 

The human C3 A cell line is a clonal derivative of HepG2/C3 (hepatoma cell line, isolated 
from a 15-year-old male with liver tumor), which was selected for strong contact inhibition of growth. 
The use of a clonal population enhances the reproducibility of the cells. C3A cells have many 
characteristics of primary human hepatocytes in culture: i) expression of insulin receptor and 

15 insulin-like growth factor II receptor; ii) secretion of a high ratio of serum albumin compared with a- 
j fetoprotein iii) convertion of ammonia to urea and glutamine; iv) metabolism of aromatic amino 
acids; and v) ability to proliferate in glucose-free and insulin-free medium. The C3A cell line is now 
well established as an in vitro model of the mature human liver (Mickelson et al. (1995) Hepatology 
22:866-875; Nagendra et al. (1997) Am. J. Physiol 272:0408-416). 

20 Clofibrate is an hypolidennic drag which lowers elevated levels of serum triglycerides. In 

rodents, chronic treatment produces hepatomegaly and an increase in hepatic peroxisomes 
(peroxisome proliferation). Peroxisome proliferators (PPs) are a class of drugs which activate the PP- 
activated receptor in rodent liver^ leading to enzyme induction, stimulation of S-phase, and a 
suppression of apoptosis (Hasmall and Roberts (1999) Pharmacol. Ther. 82:63-70). PPs mclude the 

25 fibrate class of hypolidemic drugs, phenobarbitone, thiazolidinediones, certain non-steroidal anti- 
inflammatory drugs, and naturally-occuring fatty acid-derived molecules (Gelman et aL (1999) Cell. 
Mol. Life Sci. 55:932-943). Clofibrate has been shown to increase levels of cytochrome P450 4A. It 
is also involved in transcription of ^-oxidation genes as well as induction of PP-activated receptors 
(Kawashinaa et al. (1997) Arch. Biochem. Biophys. 347:148-154). Peroxisome proliferation that is 

30 induced by both clofibrate and the chemically-related compound fenofibrate is mediated by a 
common inhibitory effect on mitochondrial membrane depolarization (Zhou and Wallace (1999) 
Toxicol. Sci. 48:82-89). 

Dexamethasone and its derivatives, dexamethasone sodium phosphate and dexamethasone 
acetate, are synthetic glucocorticoids used as anti-inflammatory or immunosuppressive agents. 

35 Dexamethasone has little to no mineralocorticoid activity and is usually selected for management of 
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cerebral edema because of its superior ability to penetrate the central nervous sytem. Glucocorticoids 
are naturally occurring hormones that prevent or suppress inflammation and immune responses when 
administered at pharmacological doses. Responses can include inhibition of leukocyte infiltration at 
the site of inflammation, interference in the function of mediators of inflammatory response, and 
suppression of humoral immune responses. The anti-inflammatory actions of corticosteroids are 
thought to involve phospholipase A2 inhibitory proteins, collectively called lipocortins. The 
numerous adverse effects related to corticosteroid use usually depend on the dose administered and 
the duration of therapy. Proposed mechanisnas of action include decreased IgE synthesis, increased 
number of p-adrenergic receptors on leukocytes, and decreased arachidonic acid metabolism. During 
an immediate allergic reaction, such as in chronic bronchial asthma, allergens bridge the IgE 
antibodies on the surface of mast cells, which triggers these cells to release chemotactic substances. 
Mast cell influx and activation, therefore, is partially responsible for the inflammation and 
hyperinitability of the oral mucosa in asthmatic patients. This inflammation can be retarded by 
administration of adrenocorticoids. As with other corticosteroids, the effects upon liver metabolism 
and hormone clearance mechanisms are important to understand the pharmacodynamics of a drug. 
Cancer 

Prostate cancer develops through a multistage progression ultintiately resulting in an 
aggressive tumor phenotype. The initial step in tumor progression involves the hyperproliferation of 
normal luminal and/or basal epithelial cells. Androgen responsive cells become hyperplastic and 
evolve into early-stage tumors. Although early-stage tumors are often androgen sensitive and respond 
to androgen ablation, a population of androgen independent cells evolve from the hyperplastic . 
population. These cells represent a more advanced form of prostate tumor that may become invasive 
and potentially become metastatic to the bone, brain, or lung. 

Breast cancer develops through a multi-step process in which pre-malignant mammary 
epithelial cells undergo a relatively defined sequence of events leading to tumor formation. An early 
event in tumor development is ductal hyperplasia. Cells undergoing rapid neoplastic growth gradually 
progress to invasive carcinoma and become metastatic to the lung, bone, and potentially other organs. 
Several variables that may influence the process of tumor progression and malignant transformation 
include genetic factors, environmental factors, growth factors, and hormones. Based on the 
complexity of this process, it is critical to study a population of human mammary epithelial cells 
undergoing the process of malignant transformation, and to associate specific stages of progression 
with phenotypic and molecular characteristics. 
Immune response proteins 

Interleukin 12 (IL-12).is a pleiotropic cytokine produced by macrophages and B lymphocytes 
that can have multiple effects on T cells and natural killer (NK) cells. Effects include mducing 
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production of BFN-y and TNF by resting and activated T and NK cells; enhancing the cytotoxic " 
activity of resting NK and T cells, inducing and synergizing with IL-2 in the generation of 
lymphokine-activated killer (LAK) cells; acting as a comitogen to stimulate proliferation of resting T 
cells; and inducing proliferation of activated T and NK cells. Current evidence indicates that IL-12, 

5 produced by macrophages in response to infectious agents, is a central mediator of the cell-mediated 
immune response by its actions on the development, proliferation, and activities of THl cells. As the 
initiator of cell-mediated immunity, IL-12 may stimulate cell-mediated imraime responses to 
microbial pathogens, metastatic cancers, and viral infections such as AIDS. 

There is a need in the art for new^ conq)ositions, including nucleic acids and proteins, for the 

10 diagnosis, prevention, and treatment of gastrointestinal, cardiovascular, autoimmune/inflanmiatory, 
cell proliferative, developmental, epithelial, neurological, and reproductive disorders. 

SUMMARY OF THE INVENTION 

Various embodinoents of the invention provide purified polypeptides, protein modification 

15 and maintenance molecules, referred to collectively as 'TMOD" and individually as "PMOD-1," 
"PMOD-2," "PMOD-3," 'TPMOD-4," "PMOD-5," "PMOD-6," 'TMOD-?," "PMOD-8," "PMOD-9," 
"PMOD-10," "PMOD-11," "PMOD-12," "PMOD-13," "PMOD-14," "PMOD-15," "PMOD-16," 
"PMOD-17," "PMOD-18," *TM0D-19," TMOD-20," "PMOD-21," 'TMOD-22," "PMOD-23," 
"PMOD-24," '^00-25," "PMOD-26," "PMOD-27," and "PMOD-28." and methods for usmg these 

20 proteins and their encoding polynucleotides for the detection, diagnosis, and treatment of diseases and 
medical conditions. Embodiments also provide methods for utilizing the purified protein 
modification and maintenance molecules and/or their encoding polynucleotides for facilitating the 
drug discovery process, including determination of efficacy, dosage, toxicity, and phannacology. 
Related embodiments provide methods for utilizing the purified protein modification and 

25 maintenance molecules and/or their encoding polynucleotides for investigating the pathogenesis of 
diseases and medical conditions. 

An embodiment provides an isolated polypeptide selected from the group consisting of a) a 
polypeptide comprising an amino acid sequence selected from the group consisting of SEQ ID NO:l- 
28, b) a polypeptide comprising a naturally occurring amino acid sequence at least 90% identical or at 

30 least about 90% identical to an amino acid sequence selected from the group consisting of SEQ ID 
NO: 1-28, c) a biologically active fragment of a polypeptide having an amino acid sequence selected 
firom the group consisting of SEQ ID NO: 1-28, and d) an inununogenic fragment of a polypeptide 
having an amino acid sequence selected fi:om the group consisting of SEQ ID NO: 1-28. Another 
embodiment provides an isolated polypeptide comprising an anaino acid sequence of SEQ ID NO: 1- 

35 28. 
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Still another embodiment provides an isolated polynucleotide encoding a polypeptide 
selected from the group consisting of a) a polypeptide comprising an amino acid, sequence selected 
from the group consisting of SEQ ID NO: 1-28, b) a polypeptide comprising a naturally occurring 
amino acid sequence at least 90% identical or at least about 90% identical to an amino acid sequence 
5 selected from the group consisting of SEQ ID NO: 1-28, c) a biologically active fragment of a 

polypeptide having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-28, 
and d) an inununogenic fragment of a polypeptide having an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-28. In another embodmient, the polynucleotide encodes a 
polypeptide selected from the group consisting of SEQ ID NO: 1-28. In an alternative embodmient, 

10 the polynucleotide is selected from the group consistmg of SEQ ID NO:29-56. 

Still another embodiment provides a recombinant polynucleotide comprising a promoter 
sequence operably Imked to a polynucleotide encoding a polypeptide selected from the group 
consisting of a) a polypeptide comprising an amino acid sequence selected from the group consisting 
of SEQ ID NO: 1-28, b) a polypeptide comprising a naturally occurring amino acid sequence at least 

15 90% identical or at least about 90% identical to an ariiino acid sequence selected from the group 
consisting of SEQ ID NO: 1-28, c) a biologically active fragment of a polypeptide having an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-28, and d) an immunogenic 
fragment of a polypeptide having an amino acid sequence selected from the group consisting of SEQ 
ID NO: 1-28. Another embodiment provides a cell transformed with the recombinant polynucleotide. 

20 Yet another embodiment provides a transgenic organism comprising the recombinant polynucleotide. 

Another embodiment provides a method for producing a polypeptide selected from the group ' ' 
consisting of a) a polypeptide comprising an amino acid sequence selected from the group consisting 
of SEQ ID NO: 1-28, b) a polypeptide comprising a naturally occurring amino acid sequence at least 
90% identical or at least about 90% identical to an amino acid sequence selected from 'the group 

25 consisting of SEQ ID NO: 1-28, c) a biologically active fragment of a polypeptide having an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-28, and d) an immunogenic 
fragment of a polypeptide having an amino acid sequence selected from the group consisting of SEQ 
ID NO: 1-28, The method comprises a) culturing a cell under conditions suitable for expression of the 
polypeptide, wherein said cell is transformed with a recombinant polynucleotide comprising a 

30 promoter sequence operably linked to a polynucleotide encoding the polypeptide, and b) recovering 
the polypeptide so expressed. 

Yet another embodiment provides an isolated antibody which specifically binds to a 
polypeptide selected from the group consisting of a) a polypeptide comprising an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-28, b) a polypeptide comprising a 

35 naturally occurring amino acid sequence at least 90% identical or at least about 90% identical to an 
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amino acid sequence selected from the group consisting of SEQ ID NO: 1-28, c) a biologically active 
fragment of a polypeptide having an amino acid sequence selected from the group consisting of SEQ 
ID NO: 1-28, and d) an immunogenic fragment of a polypeptide having an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-28. 
5 Still yet another embodiment provides an isolated polynucleotide selected from the group 

consisting of a) a polynucleotide comprising a polynucleotide sequence selected from the group 
consisting of SEQ ID NO:29-56, b) a polynucleotide comprising a naturally occurring polynucleotide 
sequence at least 90% identical or at least about 90% identical to a polynucleotide sequence selected 
from the group consisting of SEQ ID NO: 29-56, c) a polynucleotide coniqplementary to the 

10 polynucleotide of a), d) a polynucleotide complementary to the polynucleotide of b), and e) an RNA 
equivalent of a)-d). In other embodiments, the polynucleotide can comprise at least about 20, 30, 40, 
60, 80, or 100 contiguous nucleotides. 

Yet another embodiment provides a method for detecting a target polynucleotide in a sample, 
said target polynucleotide being selected from the group consisting of a) a polynucleotide comprising 

15 a polynucleotide sequence selected jfrom the group consisting of SEQ ID NO:29-56, b) a 

polynucleotide comprising a naturally occurring polynucleotide sequence at least 90% identical or at 
least about 90% identical to a polynucleotide sequence selected from the group consisting of SEQ ID 
NO:29-56, c) a polynucleotide complementary to the polynucleotide of a), d) a polynucleotide 
complementary to the polynucleotide of b), and e) an RNA equivalent of a)-d). The method 

20 comprises a) hybridizing the sample v^th a probe comprising at least 20 contiguous nucleotides 
comprismg a sequence complementary to said target polynucleotide in the sample, and which probe 
specifically hybridizes to said target polynucleotide, under conditions whereby a hybridization 
complex is formed between said probe and said target polynucleotide or fragments thereof, and b) 
detecting the presence or absence of said hybridization complex. In a related embodiment, the 

25 method can include detecting the amount of the hybridization complex. In still other embodiments, 
the probe can comprise at least about 20, 30, 40, 60, 80, or 100 contiguous nucleotides. 

Still yet another embodiment provides ia method for detecting a target polynucleotide in a 
sample, said target polynucleotide being selected from the group consisting of a) a polynucleotide 
comprising a polynucleotide sequence selected from the group consisting of SEQ ID NO:29-56, b) a 

30 polynucleotide comprising a naturally occurring polynucleotide sequence at least 90% identical or at 
least about 90% identical to a polynucleotide sequence selected from the group consisting of SEQ ID 
NO:29-56, c) a polynucleotide complementary to the polynucleotide of a), d) a polynucleotide 
complementary to the polynucleotide of b), and e) an RNA equivalent of a)-d). The method 
comprises a) amphfying said target polynucleotide or fragment thereof using polymerase chain 

35 reaction amplification, and b) detecting the presence or absence of said amplified target 
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polynucleotide or fragment thereof. In a related embodiment, the method cao include detecting the 
amount of the ampliiSed target polynucleotide or fragment thereof. 

Another embodiment provides a composition comprising an effective amount of a 
polypeptide selected from the group consisting of a) a polypeptide comprising an amino acid 
5 sequence selected from the group consisting of SEQ ID NO: 1-28, b) a polypeptide comprising a 
naturally occurring amino acid sequence at least 90% identical or at least about 90% identical to an 
amino acid sequence selected from the group consisting of SEQ ID NO: 1-28, c) a biologically active 
fragment of a polypeptide having an amino acid sequence selected from the group consisting of SEQ 
ID NO: 1-28, and d) an immunogenic fragment of a polypeptide having an amino acid sequence 

10 selected from the group consisting of SEQ ID NO: 1-28, and a pharmaceutically acceptable excipient. 
In one embodiment, the composition can comprise an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-28. Other embodiments provide a method of treating a disease or 
condition associated with decreased or abnormal expression of functional PMOD, comprising 
administering to a patient in need of such treatment the composition. 

15 Yet another embodiment provides a method for screening a compound for effectiveness as an 

agonist of a polypeptide selected from the group consisting of a) a polypeptide comprising an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-28, b) a polypeptide comprising a 
' naturally occurring amino acid sequence at least 90% identical or at least about 90% identical to an 
amino acid sequence selected from the group consisting of SEQ ID NO: 1-28, c) a biologically active 

20 fragment of a polypeptide having an amino acid sequence selected from the group consisting of SEQ 
ID NO: 1-28, and d) an immunogenic fragment of a polypeptide having an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-28. The method comprises a) exposing a sample 
comprising the polypeptide to a compound, and b) detecting agonist activity in the sample. Another 
embodiment provides a composition comprising an agonist compound identified by the method and a 

25 pharmaceutically acceptable excipient. Yet another embodiment provides a method of treating a 
disease or condition associated with decreased expression of functional PMOD, comprising 
administering to a patient in need of such treatment the composition. 

Still yet another embodiment provides a method for screening a compound for effectiveness 
as an antagonist of a polypeptide selected from the group consisting of a) a polypeptide comprising an 

30 amino acid sequence selected from the group consisting of SEQ ID NO: 1-28, b) a polypeptide 
comprising a naturally occurring amino acid sequence at least 90% identical or at least about 90% 
identical to an amino acid sequence selected from the group consisting of SEQ ID NO: 1-28, c) a 
biologically active fragment of a polypeptide having an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-28, and d) an immunogenic fragment of a polypeptide having an amino 

35 acid sequence selected from the group consisting of SEQ ID NO: 1-28. The method comprises a) 
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exposing a sample comprising the polypeptide to a compound, and b) detecting antagonist activity in 
the sample. Another embodiment provides a composition comprising an antagonist compound 
identified by the method and a pharmaceutically acceptable excipient. Yet another embodiment 
provides a method of treating a disease or condition associated with overexpression of functional 
5 PMOD, comprising administering to a patient in need of such treatment the composition. 

Another embodiment provides a method of screening for a compound that specifically binds 
to a polypeptide selected from the group consisting of a) a polypeptide comprising an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-28, b) a polypeptide comprising a 
naturally occurring amino acid sequence at least 90% identical or at least about 90% identical to an 

10 amino acid sequence selected from the group consisting of SEQ ID NO: 1-28, c) a biologically active 
fragment of a polypeptide having an amino acid sequence selected from the group consisting of SEQ 
ID NO: 1-28, and d) an immunogenic fragment of a polypeptide having an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-28. The method comprises a) combining the 
polypeptide with at least one test compound under suitable conditions, and b) detecting binding of the 

15 polypeptide to the test compound, thereby identifymg a compound that specifically binds to the 
polypeptide. 

Yet another embodiment provides a method of screening for a compound that modulates the 
activity of a polypeptide selected from the group consisting of a) a polypeptide comprising an amino 
acid sequence selected from the group consistmg of SEQ ID NO: 1-28, b) a polypeptide comprising a 

20 naturally occurring an^no acid sequence at least 90% identical or at least about 90% identical to an 
amino acid sequence selected from the group consisting of SEQ ID NO: 1-28, c) a biologically active 
fragment of a polypeptide having an amino acid sequence selected from the group consisting of SEQ 
ID NO: 1-28, and d) an immunogenic fragment of a polypeptide having an amino acid sequence 
selected from the group consisting of SEQ ED NO: 1-28. The method comprises a) combining the 

25 polypeptide with at least one test compound under conditions permissive for the activity of the 

polypeptide, b) assessing the activity of the polypeptide in the presence of the test compound, and c) 
comparing the activity of the polypeptide in the presence of the test compound with the activity of the 
polypeptide in the absence of the test compound, wherein a change in the activity of the polypeptide 
in the presence of the test compound is indicative of a compound that modulates the activity of the 

30 polypeptide. 

Still yet another embodiment provides a method for screening a compound for effectiveness 
in altering expression of a target polynucleotide, wherein said target polynucleotide comprises a 
polynucleotide sequence selected from the group consisting of SEQ ID NO:29-56, the method 
comprising a) exposing a sample comprising the target polynucleotide to a compound, b) detecting 
35 altered expression of the target polynucleotide, and c) comparing the expression of the target 
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polynucieotide in the presence of varying amounts of the compound and in the absence of the 
compound. 

Another embodiment provides a method for assessing toxicity of a test compound, said 
method comprising a) treating a biological sample containing nucleic acids with the test compoimd; 
5 b) hybridizing the nucleic acids of the treated biological sample with a probe comprising at least 20 
contiguous nucleotides of a polynucleotide selected from the group consisting of i) a polynucleotide 
comprising a polynucleotide sequence selected from the group consisting of SEQ ID NO:29-56, ii) a 
polynucleotide comprising a naturally occunring polynucleotide sequence at least 90% identical or at 
least about 90% identical to a polynucleotide sequence selected from the group consisting of SEQ ID 

10 NO:29-56, iii) a polynucleotide having a sequence complementary to i), iv) a polynucleotide 

complementary to the polynucleotide of ii), and v) an RNA equivalent of i)-iv). Hybridization occurs 
under conditions whereby a specific hybridization complex is formed between said probe and a target 
polynucleotide in the biological sample, said target polynucleotide selected from the group consisting 
of i) a polynucleotide comprising a polynucleotide sequence selected from the group consisting of 

15 SEQ ID NO:29-56, ii) a polynucleotide comprising a naturally occurring polynucleotide sequence at 
least 90% identical or at least about 90% identical to a polynucleotide sequence selected from the 
group consisting of SEQ ID NO:29-56, iii) a polynucleotide complementary to the polynucleotide of 
i), iv) a polynucleotide complementary to the polynucleotide of ii), and v) an RNA equivalent of i)- 
iv). Alternatively, the target polynucleotide can comprise a fragment of a polynucleotide selected 

20 from the group consisting of i)-v) above; c) quantifying the amount of hybridization complex; and d) 
comparing the amount of hybridization complex in the treated biological sample with the amount of *^ 
hybridization complex in an untreated biological sample, wherein a difference in the amount of 
hybridization complex in the treated biological sample is indicative of toxicity of the test compound. 

25 BRIEF DESCRIPTION OF THE TABLES 

Table 1 summarizes the nomenclature for full length polynucleotide and polypeptide 
embodiments of the invention. 

Table 2 shows the GenBank identification number and annotation of the nearest GenBank 
homolog, and the PROTEOME database identification numbers and annotations of PROTEOME 
30 database homologs, for polypeptide embodiments of the invention. The probability scores for the 
matches between each polypeptide and its homolog(s) are also shown. 

Table 3 shows structural features of polypeptide embodiments, including predicted motifs 
and domains, along with the methods, algorithms, and searchable databases used for analysis of the 
polypeptides. 
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Table 4 lists the cDNA and/or genomic DNA fragments which were used to assemble 
polynucleotide embodiments, along with selected fragments of the polynucleotides. 

Table 5 shows representative cDNA libraries for polynucleotide embodiments. 

Table 6 provides an appendix which describes the tissues and vectors used for construction of 
5 the cDNA libraries shown in Table 5. 

Table 7 shows the tools, programs, and algorithms used to analyze polynucleotides and 
polypeptides, along with applicable descriptions, references, and threshold parameters. 

Table 8 shows single nucleotide polymorphisms found in polynucleotide embodhnents, along 
with allele frequencies in different human populations. 

10 

DESCRIPTION OF THE INVENTION 

Before the present proteins, nucleic acids, and methods are described, it is understood that 
embodiments of the invention are not limited to the particular machines, instruments, materials, and 
methods described, as these may vary. It is also to be understood that the terminology used herein is 
15 for the purpose of describmg particular.embodiments only, and is not intended to limit the scope of 
the invention. 

As used herein and in the appended claims, the singular forms "a," "an," and "the" include 
plural reference unless the* context clearly dictates otherwise. Thus, for example, a reference to "a 
host cell" includes a plurality of such host cells, and a reference to "an antibody" is a reference to one 

20 or more antibodies and equivalents thereof known to those skilled in the art, and so forth. 

Unless defined otherwise, all technical and scientific terms used herein have the same 
mearungs as commonly understood by one of ordinary skill in the art to which this invention belongs. 
Although any machines, materials, and methods similar or equivalent to those described herein can be 
used to practice or test the present invention, the preferred machines, materials and methods are now 

25 described. All publications mentioned herein are cited for the purpose of describing and disclosing 
the cell lines, protocols, reagents and vectors which are reported in the publications and which might 
be used in connection with various embodiments of the invention. Nothing herein is to be construed 
as an admission that the invention is not entitled to antedate such disclosure by virtue of prior 
invention. 

30 DEFINITIONS 

"PMOD" refers to the amino acid sequences of substantially purified PMOD obtained from 
any species, particularly a mammalian species, including bovine, ovine, porcine, murine, equine, and 
human, and from any source, whether natural, synthetic, semi-synthetic, or recombinant. 

The term "agonist" refers to a molecule which intensifies or mimics the biological activity of 
35 PMOD. Agonists may include proteins, nucleic acids, carbohydrates, small molecules, or any other 
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compound or composition which modulates the activity of PMOD either by directly interacting with 
PMOD or by acting on components of the biological pathway in which PMOD participates. 

An "allelic variant" is an alternative form of the gene encoding PMOD. Allelic variants may 
result from at least one mutation in the nucleic acid sequence and may result in altered mRNAs or in 
5 polypeptides whose structure or function may or may not be altered. A gene may have none, one, or 
many allelic variants of its naturally occurring form. Common mutational changes which give rise to 
allelic variants are generally ascribed to natural deletions, additions, or substitutions of nucleotides. 
Each of these types of changes may occur alone, or in combination with the others, one or more times 
in a given sequence. 

10 "Altered" nucleic acid sequences encoding PMOD include those sequences with deletions, 

insertions, or substitutions of different nucleotides, resulting in a polypeptide the same as PMOD or a 
polypeptide with at least one functional characteristic of PMOD. Included within this definition are 
polymorphisms which may or may not be readily detectable using a particular oligonucleotide probe 
of the polynucleotide encoding PMOD, and improper or unexpected hybridization to allelic variants, 

15 with a locus other than the normal chromosomal locus for the polynucleotide encoding PMOD. The 
encoded protein may also be "altered," and may contain deletions, insertions, or substitutions of 
aimno acid residues which produce a silent change and result in a functionally equivalent PMOD. 
Deliberate amino acid substitutions may be made on the basis of one or more similarities in polarity, 
charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipathic nature of the residues, as 

20 long as the biological or immunological activity of PMOD is retained. For example, negatively 

charged amino acids nniay include aspartic acid and glutamic acid, and positively charged amino acids 
may include lysine and arginine. Amino acids with uncharged polar side chains having similar 
hydrophilicity values may include: asparagine and glutanune; and serine and threonine. Amino acids 
with uncharged side chains having similar hydrophilicity values may include: leucine, isoleucine, and 

25 valine; glycine and alanine; and phenylalanine and tyrosine. 

The terms "amino acid** and "amino acid sequence" can refer to an oligopeptide, a peptide, a 
polypeptide, or a protein sequence, or a fragment of any of these, and to naturally occurring or 
synthetic molecules. Where "amino acid sequence" is recited to refer to a sequence of a naturally 
occurring protein molecule, "amino acid sequence" and like terms are not meant to limit the amino 

30 acid sequence to the complete native amino acid sequence associated with the recited protein 
molecule. 

"Amplification" relates to the production of additional copies of a nucleic acid. 
Amplification may be carried out using polymerase chain reaction (PGR) technologies or other 
nucleic acid amplification technologies well known in the art. 
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The term "antagonist" refers to a molecule which inhibits or attenuates the biological activity 
of PMOD. Antagonists may include proteins such as antibodies, anticalins, nucleic acids, 
carbohydrates, small molecules, or any other compound or composition which modulates the activity 
of PMOD either by directly interacting with PMOD or by acting on components of the biological 
5 pathway in which PMOD participates. 

The term "antibody" refers to intact immunoglobulin molecules as well as to fragments 
thereof, such as Fab, F(ab')2, and Fv fragments, which are capable of binding an epitopic determinant. 
Antibodies that bind PMOD polypeptides can be prepared using intact polypeptides or using 
fragments containing small peptides of interest as the immunizing antigen. The polypeptide or 
10 oligopeptide used to immunize an ammal (e.g., a mouse, a rat, or a rabbit) can be derived from the 
translation of RNA, or synthesized chemically, and can be conjugated to a carrier protein if desired. 
Commonly used carriers that are chemically coupled to peptides include bovine serum albumin, 
thyroglobulin, and keyhole limpet hemocyanin (KLH). The coupled peptide is then used to immunize 
the animal. 

15 The term "antigenic determinant" refers to that region of a molecule (i.e., an epitope) that 

makes contact with a particular antibody. When a protein or a fragment of a protein is used to 
immunize a host animal, numerous regions of the protein may induce the production of antibodies 
which bind specifically to antigenic determinants (particular regions or three-dimensional structures 
on the protein). An antigenic determinant may compete with the intact antigen (i.e., the immunogen 

20 used to elicit the immune response) for bmding to an antibody. 

The term "aptamer" refers to a nucleic acid or oligonucleotide molecule that binds to a 
specific molecular target Aptamers are derived from an in vitro evolutionary process (e.g., SELEX 
(Systematic Evolution of Ligands by Exponential Enrichment), described in U.S. Patent No. ' 
5,270,163), which selects for target-specific aptamer sequences from large combinatorial libraries. 

25 Aptamer compositions may be double-stranded or smgle-stranded, and may include 

deoxyribonucleotides, ribonucleotides, nucleotide derivatives, or other nucleotide-like molecules. 
The nucleotide components of an aptamer may have modified sugar groups (e.g., the 2'-0H group of a 
ribonucleotide may be replaced by 2'-F or 2'-NH2), which may improve a desired property, e.g., 
resistance to nucleases or longer lifetime in blood. Aptamers may be conjugated to other molecules, 

30 e.g., a high molecular weight carrier to slow clearance of the aptamer from the circulatory system. 
Aptamers may be specifically cross-linked to their cognate ligands, e.g., by photo-activation of a 
cross-linker. (See, e.g., Brody, E.N. and L. Gold (2000) J. Biotechnol. 74:5-13.) 

The term "intramer" refers to an aptamer which is expressed in vivo. For example, a vaccinia 
virus-based RNA expression system has been used to express specific RNA aptamers at high levels in 

35 the cytoplasm of leukocytes (Blind, M. et al. (1999) Proc. Natl. Acad. Sci. USA 96:3606-3610). 
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The term "spiegelmer" refers to an aptamer which includes L-DNA, L-RNA, or other left- 
handed nucleotide derivatives or nucleotide-like molecules. Aptamers containing left-handed 
nucleotides are resistant to degradation by naturally occurring enzymes, which nonnally act on 
substrates containing right-handed nucleotides. 
5 The term "antisense" refers to any composition capable of base-pairing with the "sense" 

(coding) strand of a polynucleotide having a specific nucleic acid sequence. Antisense compositions 
may include DNA; RNA; peptide nucleic acid (PNA); oligonucleotides having modified backbone 
linkages such as phosphorothioates, methylphosphonates, or benzylphosphonates; oligonucleotides 
having modified sugar groups such as 2'-methoxyethyl sugars or 2'-methoxyethoxy sugars; or ' 

10 oligonucleotides having modified bases such as 5-methyl cytosine, 2 -deoxyuracil, or 7-deaza-2" 

deoxyguanosine. Antisense molecules may be produced by any method including chemical synthesis 
or transcription. Once introduced into a cell, the complementary antisense molecule base-pairs with a 
naturally occurring nucleic acid sequence produced by the cell to form duplexes which block either 
transcription or translation. The designation "negative" or "minus" can refer to the antisense strand, 

15 and the designation "positive" or "plus" can refer to the sense strand of a reference DNA molecule. 

The term "biologically active" refers to a protein having structural, regulatory, or biochemical 
functions of a naturally occurring molecule. Likewise, "immunologically active" or "immunogenic" 
refers to the capability of the natural, recombinant, or synthetic PMOD, or of any oligopeptide 
thereof, to induce a specific immune response in appropriate animals'or cells and to bind with specific 

20 antibodies. 

"Complementary" describes the relationship between two smgle-stranded nucleic acid ' ^ 
sequences that anneal by base-pairmg. For example, 5 -AGT-S* pairs with its complement, 
3'-TCA-5'. 

A "composition comprising a given polynucleotide" and a "composition con^rising a given 
25 polypeptide" can refer to any composition containing the given polynucleotide or polypeptide. The 
composition may comprise a. dry formulation or an aqueous solution. Compositions comprising 
polynucleotides encoding PMOD or fragments of PMOD may be employed as hybridization probes. 
The probes may be stored in freeze-dried form and may be associated with a stabilizing agent such as 
a carbohydrate. In hybridizations, the probe may be deployed in an aqueous solution containing salts 
30 (e.g., NaCl), detergents (e.g., sodium dodecyl sulfate; SDS), and other components (e.g., Denhardt's 
solution, dry milk, salmon sperm DNA, etc.). 

"Consensus sequence" refers to a nucleic acid sequence which has been subjected to repeated 
DNA sequence analysis to resolve uncalled bases, extended using the XL-PCR kit (Applied 
Biosystems, Foster City CA) in the 5' and/or the 3' direction, and resequenced, or which has been 
35 assembled from one or more overiapping cDNA, EST, or genomic DNA fragments using a computer 
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program for fragment assembly, such as the GEL VIEW fragment assembly system (GCG, Madison 
WI) or Phrap (University of Washington, Seattle WA). Some sequences have been both extended and 
assembled to produce the consensus sequence. 

"Conservative amino acid substitutions" are those substitutions that are predicted to least. 
5 interfere with the properties of the original protein, i.e., the structure and especially the function of 
the protein is conserved and not significantly changed by such substitutions. The table below shows 
amino acids which may be substituted for an original amino acid in a protein and which are regarded 





as conservative amino acid substitutions. 

Original Residue 


Conservative Substitution 


10 


Ala 


Gly, Ser 




Arg 


His, Lys 




Asn 


Asp, Gin, His 




Asp 


Asn, Glu 




Cys 


Ala, Ser 


15 


Gin 


Asn, Glu, His 




Glu 


Asp, Gin, His 




Gly 


Ala 




His 


Asn, Arg, Gin, Glu 




De 


Leu, Val ' 


20 


Leu 


He, Val 




Lys 


Arg, Gbi, Glu 




Met 


Leu, He 




Phe 


His, Met, Leu, Trp, Tyr 




Ser 


Cys, Thr 


25 


Thr 


Ser, Val 




Trp 


Phe, Tyr 




Tyr 


His, Phe, Trp 




Val 


He. Leu, Thr 



30 Conservative amino acid substitutions generally maintain (a) the structure of the polypeptide 

backbone in the area of the substitution, for example, as a beta sheet or alpha helical conformation, 
(b) the charge or hydrophobicity of the molecule at the site of the substitution, and/or (c) the bulk of 
. the side chain. 

A "deletion" refers to a change in the amino acid or nucleotide sequence that results in the 
35 absence of one or more amino acid residues or nucleotides. 

The term "derivative" refers to a chemically modified polynucleotide or polypeptide. 
Chemical modifications of a polynucleotide can include, for example, replacement of hydrogen by an 
alkyl, acyl, hydroxyl, or amino group. A derivative polynucleotide encodes a polypeptide which 
retains at least one biological or immunological function of the natural molecule. A derivative 
40 polypeptide is one modified by glycosylation, pegylation, or any similar process that retains at least 
one biological or immunological function of the polypeptide from which it was derived. 
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A "detectable label" refers to a reporter molecule or enzyme that is capable of generating a 
measurable signal and is covalently or noncovalently joined to a polynucleotide or polypeptide. 

"Differential expression" refers to increased or upregulated; or decreased, downregulated, or 
absent gene or protein expression, determined by comparing at least two different samples. Such 
5 comparisons may be carried out between, for example, a treated and an untreated sample, or a 
diseased and a normal sample. 

"Exon shuffling" refers to the recombination of different coding regions (exons). Since an 
axon may represent a structural or functional domain of the encoded protein, new proteins may be 
assembled through the novel reassortment of stable substructures, thus allowing acceleration of the 
10 evolution of new protein functions. 

A "fragment" is a unique portion of PMOD or a polynucleotide encoding PMOD which can 
be identical in sequence to, but shorter in length than, the parent sequence. A fragment may comprise 
up to the entire length of the defined sequence, minus one nucleotide/amino acid residue. For 
example, a fragment may comprise from about 5 to about 1000 contiguous nucleotides or amino acid 
15 residues. A fragment used as a probe, primer, antigen, therapeutic molecule, or for other purposes, 
may be at least 5, 10, 15, 16, 20, 25, 30, 40, 50, 60, 75, 100, 150, 250 or at least 500 contiguous 
nucleotides or amino acid residues in length. Fragments may be preferentially selected from certain 
regions of a molecule. For example, a polypeptide fragment may con^rise a certain length of 
contiguous amino acids selected from the fnsi 250 or 500 amino acids (or first 25% or 50%) of a 
20 polypeptide as shown in a certain defined sequence. Clearly these lengths are exemplary, and any 
length that is supported by the specification, including the Sequence Listing, tables, and figures, may 
be encompassed by the present embodiments. 

A fragment of SEQ ID NO:29-56 can comprise a region of unique polynucleotide sequence 
that specifically identifies SEQ ID NO:29-56, for example, as distinct from any other sequence in the 
25 genome from which the fragment was obtained. A fragment of SEQ ID NO:29-56 can be employed 
in one or more embodiments of methods of the invention, for example, in hybridization and 
amplification technologies and in analogous methods that distinguish SEQ ID NO:29-56 from related 
polynucleotides. The precise length of a fragment of SEQ ID NO:29-56 and the region of SEQ ID 
NO;29-56 to which the fragment corresponds are routinely determinable by one of ordinary skill in 
30 the art based on the intended purpose for the fragment. 

A fragment of SEQ ID NO: 1-28 is encoded by a fragment of SEQ ID NO:29-56. A fragment 
> of SEQ ID NO: 1-28 can comprise a region of unique amino acid sequence that specifically identifies 
SEQ ID NO: 1-28. For example, a fragment of SEQ ID NO: 1-28 can be used as an inmiunogenic 
peptide for the development of antibodies that specifically recognize SEQ ID NO: 1-28. The precise 
35 length of a fragment of SEQ ID NO: 1-28 and the region of SEQ ID NO: 1-28 to which the fragment 
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corresponds can be determined based on the intended purpose for the fragment using one or more 
analytical methods described herein or otherwise known in the art. 

A "full length" polynucleotide is one containing at least a translation initiation codon (e.g., 
methionine) followed by an open reading frame and a translation termination codon. A "full length" 
5 polynucleotide sequence encodes a "full length" polypeptide sequence. 

"Homology" refers to sequence sincdlarity or, interchangeably, sequence identity, between 
two or more polynucleotide sequences or two or more polypeptide sequences. 

The terms "percent identity" and "% identity," as applied to polynucleotide sequences, refer 
to the percentage of residue matches between at least two polynucleotide sequences aligned using a 

10 standardized algorithm. Such an algorithm may insert, in a standardized and reproducible way, gaps 
in the sequences being compared in order to optimize alignment between two sequences, and 
therefore achieve a more meaningful comparison of the two sequences. 

Percent identity between polynucleotide sequences may be determined using one or more 
computer algorithms or programs known in the art or described herein. For example, percent identity 

15 can be detennined using the default parameters of the CLUSTAL V algorithm as incorporated into 
the MEGALIGN version 3. 12e sequence alignment program. This program is part of the 
LASERGENE software package, a suite of molecular biological analysis programs (DNASTAR, 
Madison WI). CLUSTAL V is described in Higgins, D.G. and P.M. Sharp (1989) CABIOS 5:151- 
153 and in Higgins, D.G. et al. (1992) CABIOS 8:189-191. For paurwise alignments of 

20 polynucleotide sequences, the default parameters are set as follows: Ktuple=2, gap penalty=5, 

window=4, and "diagonals saved"=4. The "weighted" residue weight table is selected as the default. 
Percent identity is reported by CLUSTAL V as the "percent similarity" between aligned 
polynucleotide sequences. 

Alternatively, a suite of commonly used and freely available sequence comparison algorithms 

25 which cian be used is provided by the National Center for Biotechnology Information (NCBI) Basic 
Local Alignment Search Tool (BLAST) (Altschul, S.F. et al. (1990) J. Mol. Biol. 215:403-410), 
which is available from several sources, including the NCBI, Bethesda, MD, and on the Internet at 
http://www.ncbi.nlm.nih.gov/BLAST/. The BLAST software suite includes various sequence 
analysis programs including "blastn," that is used to align a known polynucleotide sequence with 

30 other polynucleotide sequences from a variety of databases. Also available is a tool called "BLAST 2 
Sequences" that is used for direct pairwise comparison of two nucleotide sequences. "BLAST 2 
Sequences" can be accessed and used interactively at http://www.ncbi.nlm.nih.gov/gorf/bl2.html. 
The "BLAST 2 Sequences" tool can be used for both blastn and blastp (discussed below). BLAST 
programs are commonly used with gap and other parameters set to default settings. For example, to 

35 compare two nucleotide sequences, one may use blastn with the "BLAST 2 Sequences" tool Version 
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2.0.12 (April-21-2000) set at default parameters. Such default parameters may be, for example: 
Matrix: BLOSUM62 
Reward for match: 1 
Penalty for mismatch: -2 
5 Open Gap: 5 and Extension Gap: 2 penalties 

Gap X drop-off: 50 
Expect: 10 
Word Size: 11 
Filter: on 

10 Percent identity may be measured over the length of an entire defined sequence, for example, 

as defined by a particular SEQ ID number, or may be measured over a shorter length, for example, 
over the length of a fragment taken from a larger, defined sequence, for instance, a fragment of at 
least 20, at least 30, at least 40, at least 50, at least 70, at least 100, or at least 200 contiguous 
nucleotides. Such lengths are exemplary only, and it is understood that any fragment length 

15 supported by the sequences shown herein, in the tables, figures, or Sequence Listing, naay be used to 
describe a length over which percentage identity may be measured. 

Nucleic acid sequences that do not show a high degree of identity may nevertheless encode 
similar amino acid sequences due to the degeneracy of the genetic code. It is understood that changes 
in a nucleic acid sequence can be made using this degeneracy to produce multiple nucleic acid 

20 sequences that all encode substantially the same protein. 

The phrases "percent identity" and "% identity," as applied to polypeptide sequences, refer to 
the percentage of residue matches between at least two polypeptide sequences aligned using a 
standardized algorithm. Methods of polypeptide sequence alignment are well-known. Some 
alignment methods take into account conservative amino acid substitutions. Such conservative 

25 substitutions, explained in more detail above, generally preserve the charge and_hydrophobicity at the 
site of substitution, thus preserving the structure (and therefore function) of the polypeptide. 

Percent identity between polypeptide sequences may be determined using the default 
parameters of the CLUSTAL V algorithm as incorporated into the MEGALIGN version 3.12e 
sequence alignment program (described and referenced above). For pairwise alignments of 

30 polypeptide sequences using CLUSTAL V, the default parameters are set as follows: Ktuple=l, gap 
penalty=3, window=5, and "diagonals saved' =5. The PAM250 matrix is selected as the default 
residue weight table. As with polynucleotide alignments, the percent identity is reported by 
CLUSTAL V as the "percent similarity" between aligned polypeptide sequence pairs. 

Alternatively the NCBI BLAST software suite may be used. For example, for a pairwise 

35 comparison of two polypeptide sequences, one may use the "BLAST 2 Sequences" tool Version 
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2.0. 12 (April-21-2000) with blastp set at default parameters. Such default parameters may be, for 
example: 

Matrix: BLOSUM62 

Open Gap: 11 and Extemion Gap: 1 penalties 
5 Gap X drop-off: 50 

Expect: 10 
Word Size: 3 
Filter: on 

Percent identity may be measured over the length of an entire defined polypeptide sequence, 

10 for exaniple, as defined by a particular SEQ ID number, or may be measured over a shorter length, for 
exancple, over the length of a fragment taken from a larger, defined polypeptide sequence, for 
instance, a fi-agment of at least 15, at least 20, at least 30, at least 40, at least 50, at least 70 or at least 
150 contiguous residues. Such lengths ate exemplary only, and it is understood that any fi-agment 
length supported by the sequences shown herein, in the tables, figures or Sequence Listing, may be 

15 used to describe a length over which percentage identity may be measured. 

'Human artificial chromosomes" (HACs) are linear microchromosomes which may contain 
DNA sequences of about 6 kb to 10 Mb in size and which contain all of the elements required for 
chromosome replication, segregation and maintenance. 

The term *liumanized antibody" refers to an antibody molecule in which the amino acid 

20 sequence in the non-antigen binding regions has been altered so that the antibody more closely 
resembles a human antibody, and still retains its original binding ability. 

"Hybridization" refers to the process by which a polynucleotide strand anneals with a 
complementary strand through base pairing under defined hybridization conditions. Specific 
hybridization is an indication that two nucleic acid sequences share a high degree of complementarity. 

25 Specific hybridization complexes form under permissive annealing conditions and remain hybridized 
after the "washing" step(s). The washing step(s) is particularly important in detennining the 
stringency of the hybridization process, with more stringent conditions allowing less non-specific 
binding, i.e., binding between pairs of nucleic acid strands that are not perfectly matched. Permissive 
conditions for annealing of nucleic acid sequences are routinely determinable by one of ordinary skill 

30 in the art and may be consistent among hybridization experiments, whereas wash conditions may be 
varied among experiments to achieve the desired stringency, and therefore hybridization specificity. 
Permissive annealing conditions occur, for example, at 68°C in the presence of about 6 x SSC, about 
1% (w/v) SDS, and about 100 jUg/ml sheared, denatured salmon sperm DNA. 

Generally, stringency of hybridization is expressed, in part, with reference to the temperature 

35 under which the wash step is carried out. Such wash tejoaperatures are typically selected to be about 
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S^'C to 20*^0 lower than the thermal melting point (TJ for the specific sequence at a defined ionic 
strength and pH. The is the temperature (under defined ionic strength and pH) at which 50% of 
the target sequence hybridizes to a perfectly matched probe. An equation for calculating and 
conditions for nucleic acid hybridization are well known and can be found in Sambrook, J. et al. 

5 (1989) Molecular Cloning: A Laboratory Manual 2"^ ed., vol. 1-3, Cold Spring Harbor Press, 
Plainview NY; specifically see volume 2, chapter 9. 

High stringency conditions for hybridization between polynucleotides of the present 
invention include wash conditions of 68°C in the presence of about 0.2 x SSC and about 0.1% SDS, 
for 1 hour. Alternatively, temperatures of about 65°C, 60°C, 55°C, or 42°C may be used. SSC 

10 concentration may be varied from about 0. 1 to 2 x SSC, with SDS being present at about 0. 1%. 
Typically, blocking reagents are used to block non-specific hybridization. Such blocking reagents 
include, for instance, sheared and denatured salmon sperm DNA at about 100-200 /ig/ml. Organic 
solvent, such as formamide at a concentration of about 35-50% v/v, may also be used under particular 
circumstances, such as for RNAiDNA hybridizations. Useful variations on these wash conditions 

15 will be readily apparent to those of ordmary skill in the art. Hybridization, particularly under high 
stringency conditions, may be suggestive of evolutionary similarity between the nucleotides. Such 
similarity is strongly indicative of a similar role for the nucleotides and their encoded polypeptides. 

The term "hybridization complex" refers to a conq)lex formed between two nucleic acids by 
virtue of the formation of hydrogen bonds between complementary bases. A hybridization complex 

20 may be formed in solution (e.g.. Cot or Rot analysis) or formed between one nucleic acid present in 
solution and another nucleic acid immobilized on a solid support (e.g., paper, membranes, filters, 
chips, pins or glass slides, or any other appropriate substrate to which cells or their nucleic acids have 
been fixed). 

The words "insertion" and "addition" refer to changes in an amino acid or polynucleotide 
25 sequence resulting in the addition of one or more amino acid residues or nucleotides, respectively. 

"Immune response" can refer to conditions associated with inflammation, trauma, immune 
disorders, or infectious or genetic disease, etc. These conditions can be characterized by expression 
of various factors, e.g., cytokines, chemokines, and other signaling molecules, which may affect 
cellular and systemic defense systems. 
30 An "immunogenic fragment" is a polypeptide or oligopeptide fragment of PMOD which is 

capable of eliciting an immune response when introduced into a living organism, for example, a 
mammal. The term "immunogenic fragment" also includes any polypeptide or oligopeptide firagment 
of PMOD which is useful in any of the antibody production methods disclosed herein or known in the 
art. 

35 The term "microarray" refers to an arrangement of a plurality of polynucleotides. 
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polypeptides, antibodies, or other chemical compounds on a substrate. 

The terms "element" and "array element" refer to a polynucleotide, polypeptide, antibody, or 
other chemical compound having a unique and defined position on a microaixay. 

The term "modulate" refers to a change in the activity of PMOD. For example, modulation 
5 may cause an increase or a decrease in protein activity, binding characteristics, or any other 
biological, functional, or immunological properties of PMOD. 

The phrases "nucleic acid" and "nucleic acid sequence" refer to a nucleotide, oligonucleotide, 
polynucleotide, or any fragment thereof. These phrases also refer to DNA or RNA of genomic or 
synthetic origui which may be smgle-stranded or double-stranded and naay represent the sense or the 
10 antisense strand, to peptide nucleic acid (PNA), or to any DNA-like or RNA-like material. 

"Operably linked" refers to the situation in which a first nucleic acid sequence is placed in a 
functional relationship with a second nucleic acid sequence. For instance, a promoter is operably 
linked to a coding sequence if the promoter affects the transcription or expression of the coding 
sequence.' Operably linked DNA sequences may be m close proximity or contiguous and, where 
15 necessary to join two protein coding regions, in the same reading frame. 

'Teptide nucleic acid" (PNA) refers to an antisense molecule or anti-gene agent which 
comprises an oligonucleotide of at least about 5 nucleotides in length linked to a peptide backbone of 
amino acid residues ending in lysme. The terminal lysine confers solubility to the composition. 
PNAs preferentially bind complementary single stranded DNA or RNA and stop transcript 
20 elongation, and may be pegylated to extend their lifespan in the cell. 

"Post-translational modification" of an PMOD may involve lipidation, glycosylation, 

phosphorylation, acetylation, racenaization, proteolytic cleavage, and other modifications known in 

the art. These processes may occur synthetically or biochemically. Biochemical modifications will 
« 

vary by cell type depending on the enzymatic milieu of PMOD. 

25 "Probe" refers to nucleic acids encoding PMOD, their complements, or fragments thereof, 

wliich are used to detect identical, allelic or related nucleic acids. Probes are isolated 
oligonucleotides or polynucleotides attached to a detectable label or reporter molecule. Typical 
labels include radioactive isotopes, ligands, chemiluminescent agents, and enzymes. "Primers" are 
short nucleic acids, usually DNA oligonucleotides, which may be annealed to a target polynucleotide 

30 by complementary base-pairing. The primer may then be extended along the target DNA strand by a 
DNA polymerase enzyme. 

Primer pairs can be used for amplification (and identification) of a nucleic acid, e.g., by the 
polymerase chain reaction (PGR). 

Probes and primers as used in the present invention typically comprise at least 15 contiguous 
35 nucleotides of a known sequence. In order to enhance specificity, longer probes and primers may also 
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be employed, such as probes and primers that comprise at least 20, 25, 30, 40, 50, 60, 70, 80, 90, 100, 
or at least 150 consecutive nucleotides of the disclosed nucleic acid sequences. Probes and primers 
may be considerably longer than these examples, and it is understood that any length supported by the 
specification, including the tables, figures, and Sequence Listing, may be used. 
5 Methods for preparing and using probes and primers are described in the references, for 

example Sambrook, J. et al. (1989) Molecular Cloning: A Laboratory Manual , 2"^ ed., vol. 1-3, Cold 
Spring Harbor Press, Plainview NY; Ausubel, F.M. et al. (1987) Current Protocols in Molecular 
Biology , Greene Publ. Assoc. & Wiley-Intersciences, New York NY; Lmis, M. et al. (1990) PCR 
Protocols. A Guide to Methods and Applications . Academic Press, San Diego CA. PCR primer pairs 
10 can be derived from a known sequence, for exanq>le, by using computer programs intended for that 
purpose such as Primer (Version 0.5, 1991, Whitehead Institute for Biomedical Research, Cambridge 
MA). 

Oligonucleotides for use as primers are selected using software known in the art for such 
purpose. For example, OLIGO 4.06 software is useful for the selection of PCR primer pairs of up to 

15 100 nucleotides each, and for the analysis of oligonucleotides and larger polynucleotides of up to 
5,000 nucleotides fi-om an input polynucleotide sequence of up to 32 kilobases. Similar primer 
selection programs have incorporated additional features for expanded capabilities. For example, the 
PrimOU primei: selection program (available to the public from the Genome Center at University of 
Texas South West Medical Center, Dallas TX) is capable of choosing specific primers firom 

20 megabase sequences and is thus useful for designing primers on a genome-wide scope. The PrimerS 
primer selection program (available to the public from the Whitehead Institute/MTT Center for 
Genome Research, Cambridge MA) allows the user to input a "mispriming library," in which 
sequences to avoid as primer binding sites are user-specified. PrimerS is useful, in particular, for the 
selection of oligonucleotides for microarrays. (The source code for the latter two primer selection 

25 programs may also be obtained firom their respective sources and modified to meet the user's specific 
needs.) The PrimeGen program (available to the public firom the UK Human Genome Mapping 
Project Resource Centre, Cambridge UK) designs primers based on multiple sequence alignments, 
thereby allowing selection of primers that hybridize to either the most conserved or least conserved 
regions of aligned nucleic acid sequences. Hence, this program is useful for identification of both 

30 unique and conserved oligonucleotides and polynucleotide fragments. The oligonucleotides and 

polynucleotide fragments identified by any of the above selection methods are useful in hybridization 
technologies, for example, as PCR or sequencing primers, microarray elements, or specific probes to 
identify fully or partially complementary polynucleotides in a sample of nucleic acids. Methods of 
oligonucleotide selection are not limited to those described above. 
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A "recombinant nucleic acid" is a nucleic acid that is not naturally occurring or has a 
sequence that is made by an artificial combination of two or more otherwise separated segments of 
sequence. This artificial combination is often accomplished by chemical synthesis or, more 
commonly, by the artificial manipulation of isolated segments of nucleic acids, e.g., by genetic 

5 engineering techniques such as those described in Sambrook, supra. The term recombinant includes 
nucleic acids that have been altered solely by addition, substitution, or deletion of a portion of the 
nucleic acid. Frequently, a recombinant nucleic acid may include a nucleic acid sequence operably 
linked to a promoter sequence. Such a recombinant nucleic acid may be part of a vector that is used, 
for example, to transform a cell. 

10 Alternatively, such recombinant nucleic acids may be part of a viral vector, e.g., based on a 

vaccinia virus, that could be use to vaccinate a mammal wherein the recombinant nucleic acid is 
expressed, inducing a protective immunological response in the mammal. 

A "regulatory element" refers to a nucleic acid sequence usually derived from imtranslated 
regions of a gene and includes enhancers, promoters, introns, and 5* and 3* untranslated regions 

15 CUTRs). Regdatory elements interact with host or viral proteins which control transcription, 
translation, or RNA stability. 

"Reporter molecules" are chemical or biochemical moieties used for labeling a nucleic acid, 
amino acid, or antibody. Reporter molecules include radionuclides; enzymes; fluorescent, 
chemiluminescent, or chromogenic agents; substrates; cofactors; inhibitors; magnetic particles; and 

20 other moieties known in the art. 

An "RNA equivalent," m reference to a DNA molecule, is composed of the same linear 
sequence of nucleotides as the reference DNA molecule with the exception that all occurrences of the 
nitrogenous base thymine are replaced with uracil, and the sugar backbone is composed of ribose 
instead of deoxyribose. 

25 The term "sample" is used in its broadest sense. A sample suspected of containing PMOD, 

nucleic acids encoding PMOD, or fragments thereof may comprise a bodily fluid; an extract from a 
cell, chromosome, organelle, or membrane isolated from a cell; a cell; genomic DNA, RNA, or 
cDNA, in solution or bound to a substrate; a tissue; a tissue print; etc. 

The terms "specific binding" and "specifically binding" refer to that interaction between a 

30 protein or peptide and an agonist, an antibody, an antagonist, a small molecule, or any natural or 
synthetic binding composition. The interaction is dependent upon the presence of a particular 
structure of the protein, e.g., the antigenic determinant or epitope, recognized by the binding 
molecule. For example, if an antibody is specific for epitope "A," the presence of a polypeptide 
comprising the epitope A, or the presence of free unlabeled A, in a reaction containing free labeled A 

35 and the antibody will reduce the amount of labeled A that binds to the antibody . / 
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The term "substantially purified" refers to nucleic acid or amino acid sequences that are 
removed from their natural environment and are isolated or separated, and are at least about 60% free, 
preferably at least about 75% free, and most preferably at least about 90% free from other 
components with w^hich they are naturally associated. 
5 A "substitution" refers to the replacement of one or more amino acid residues or nucleotides 

by different amino acid residues or nucleotides, respectively. 

"Substrate" refers to any suitable rigid or semi-rigid support including membranes, filters, 
chips, slides, wafers, fibers, magnetic or nonmagnetic beads, gels, tubmg, plates, polymers, 
microparticles and capillaries. The substrate can have a variety of surface forms, such as wells, 

10 trenches, pins, channels and pores, to which polynucleotides or polypeptides are bound. 

A "transcript image" or "expression profile" refers to the collective pattern of gene 
expression by a particular cell type or tissue under given conditions at a given time. 

"Transformation" describes a process by which exogenous DNA is introduced into a recipient 
cell. Transformation may occur under natural or artificial conditions according to various methods 

15 well known in the art, and may rely on any knowm method for the insertion of foreign nucleic acid 
sequences into a prokaryotic or eukaryotic host cell. The method for transformation is selected based 
on the type of host cell being transformed and may include, but is not limited to, bacteriophage or 
viral infection, electroporation, heat shock, lipofection, and particle bombardment. The term 
"transformed cells" includes stably transformed cells in which the inserted DNA is capable of 

20 replication either as an autonomously replicating plasmid or as part of the host chromosome, as well 
as transiently transformed cells which express the inserted DNA or RNA for linaited periods of time. 

A "transgenic organism," as used herein, is any organism, including but not limited to 
animals and plants, in which one or more of the cells of the organism contains heterologous nucleic 
acid introduced by way of human intervention, such as by transgenic techniques well known in the 

25 art. The nucleic acid is introduced into the cell, directly or indirectly by introduction into a precursor 
of the cell, by way of deliberate genetic manipulation, such as by microinjection or by infection with 
a recombinant virus. In another embodiment, the nucleic acid can be introduced by infection with a 
recombinant viral vector, such as a lenti viral vector (Lois, C. et al. (2002) Science 295:868-872). The 
term genetic manipulation does not include classical cross-breeding, or in vitro fertilization, but 

30 rather is directed to the introduction of a recombinant DNA molecule. The transgenic organisms 
contemplated in accordance with the present invention include bacteria, cyanobacteria, fungi, plants 
and animals. The isolated DNA of the present invention can be introduced into the host by methods 
known in the art, for example infection, transfection, transformation or transconjugation. Techniques 
for transferring the DNA of the present invention into such organisms are widely known and provided 

35 in references such as Sambrook et al. (1989), supra. 
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A "variant" of a particular nucleic acid sequence is defined as a nucleic acid sequence having 
at least 40% sequence identity to the particular nucleic acid sequence over a certain length of one of 
the nucleic acid sequences using blastn with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07- 
1999) set at default parameters. Such a pair of nucleic acids may show, for example, at least 50%, at 
5 least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 
93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% or greater 
sequence identity over a certain defined length. A variant may be described as, for example, an 
"allehc" (as defined above), "splice," "species," or "polymorphic" variant A splice variant may have 
significant identity to a reference molecule, but wiU generally have a greater or lesser number of 

10 polynucleotides due to alternate splicing of exons during mRNA processing. The corresponding 
polypeptide may possess additional functional domains or lack domains that are present in the 
reference molecule. Species variants are polynucleotides that vary jfrom one species to another. The 
resulting polypeptides will generally have significant anrino acid identity relative to each other. A 
polymorphic variant is a variation in the polynucleotide sequence of a particular gene between 

15 individuals of a given species. Polymorphic variants also may encompass "single nucleotide 

polymorphisms" XSNPs) in which the polynucleotide sequence varies by one nucleotide base. The 
presence of SNPs may be indicative of, for example, a certain population, a disease state, or a 
propensity for a disease state. 

A "variant" of a particular polypeptide sequence is defined as a polypeptide sequence having 

20 at least 40% sequence identity to the particular polypeptide sequence over a certain length of one of 
the polypeptide sequences using blastp with the "BLAST 2 Sequences" tool Version 2.0.9 (May«-07- 
1999) set at default parameters. Such a pair of polypeptides may show, for example, at least 50%, at 
least 60%, at least 70%, at least 80%, at least 90%, at least 91%, at least 92%, at least 93%, at least 
94%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% or greater sequence 

25 identity over a certain defined length of one of the polypeptides. 

THE INVENTION 

Various embodiments of the invention include new human protein modification and 
maintenance molecules (PMOD), the polynucleotides encoding PMOD, and the use of tiiese 
30 compositions for the diagnosis, treatment, or prevention of gastrointestinal, cardiovascular, 
autoimmune/inflammatory, cell proliferative, developmental, epithelial, neurological, and 
reproductive disorders. 

Table 1 summarizes the nomenclature for the full length polynucleotide and polypeptide 
embodiments of the invention. Each polynucleotide and its corresponding polypeptide are correlated 
35 to a single Incyte project identification number (Incyte Project ID). Each polypeptide sequence is 
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denoted by both a polypeptide sequence identification number (Polypeptide SEQ ID NO:) and an 
bicyte polypeptide sequence number (Incyte Polypeptide ID) as shown. Each polynucleotide 
sequence is denoted by both a polynucleotide sequence identification number (Polynucleotide SEQ 
ID NO:) and an Incyte polynucleotide consensus sequence number (Incyte Polynucleotide ID) as 
shown. Column 6 shows the Incyte ID numbers of physical, full length clones corresponding to 
polypeptide and 

polynucleotide embodiments. The full length clones encode polypeptides which have at least 95% 
sequence identity to the polypeptides shown in column 3. 

Table 2 shows sequences with homology to the polypeptides of the invention as identified by 
BLAST analysis against the GenBank protein (genpept) database and the PROTEOME database. 
Colunms 1 and 2 show the polypeptide sequence identification number (Polypeptide SEQ ID NO:) 
and die corresponding Incyte polypeptide sequence number (Incyte Polypeptide ID) for polypeptides 
of the invention. Column 3 shows the GenBank identification number (GenBank ID NO:) of the 
nearest GenBank homolog and the PROTEOME database identification numbers (PROTEOME ID 
NO:) of the nearest PROTEOME database homologs. Column 4 shows the probability scores for the 
matches between each polypeptide and its homolog(s). Column 5 shows the annotation of the 
GenBank and PROTEOME database homolog(s) along with relevant citations where applicable, all of 
which are expressly incorporated by reference herem. 

Table 3 shows various structural features of the polypeptides of the invention. Columns 1 
and 2 show the polypeptide sequence identification number (SEQ ID NO:) and the corresponding 
Incyte polypeptide sequence number (Incyte Polypeptide ID) for each polypeptide of the invention. 
Column 3 shows the number of amino acid residues in each polypeptide. Column 4 shows potential 
phosphorylation sites, and column 5 shows potential glycosylation sites, as determined by the 
MOTIFS program of the GCG sequence analysis software package (Genetics Computer Group, 
Madison WI). Column 6 shows anciino acid residues comprising signature sequences, domains, and 
motifs. Column 7 shows analytical methods for protein structure/function analysis and in some cases, 
searchable databases to which the analytical methods were applied. 

Together, Tables 2 and 3 summarize the properties of polypeptides of the invention, and these 
properties establish that the claimed polypeptides are protein modification and maintenance 
molecules. For example, SEQ ID NO: 1 is 43% identical, from residue K223 to residue A774, to 
Arabidopsis thaliana ubiquitin-protein ligase 1 (GenBank DD g7108521) as determined by the Basic 
Local Alignment Search Tool (BLAST). (See Table 2.) The BLAST probability score is 1.3e-85, 
which indicates the probability of obtaining the observed polypeptide sequence alignment by chance. 
SEQ ID NO: 1 also contains a HECT (ubiquitin-transferase) domain as determined by searching for 
statistically significant matches in the hidden Markov model (HMM)-based PFAM database of 
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conserved protein family domains. (See Table 3.) Data from BLIMPS and BLAST analyses provide 
further corroborative evidence that SEQ ID NO: 1 is a ubiquitin-protein ligase. 

As another example, SEQ ID N0:5 is 38% identical, from residue E22 to residue K368, to 
Arabidopsis thaliana ubiquitin-specific protease 26 (GenBank ID gl 1993492) as determined by the 

5 Basic Local Alignment Search Tool (BLAST). (See Table 2.) The BLAST probability score is Lie- 
79, which indicates the probability of obtaining the observed polypeptide sequence alignment by 
chance. SEQ ID NO:5 also contains a ubiquitin carboxl-terminal hydrolases 1 domain and a ubiquitin 
carboxl-terminal hydrolases 2 domain as determined by searching for statistically significant matches 
in the hidden Markov model (HMM)-based PFAM database of conserved protein family donmins. 

10 (See Table 3.) Data from BLIMPS, MOTIFS, and additional BLAST analyses provide further 
corroborative evidence that SEQ ID NO:5 is a ubiquitin carboxyl terminal hydrolase. 

As another example, SEQ ID N0:7 is 91% identical, fromresidue P23 to residue S531, to a 
human carboxypeptidase N (GenBank ID gl79936) as determined by the Basic Local Alignment 
Search Tool (BLAST). (See Table 2.) The BLAST probability score is 1.7e-235, v/hich indicates the 

15 probability of obtaining the observed polypeptide sequence alignment by chance. SEQ ID N0:7 also 
contains leucine-rich repeat domains as determined by searching for statistically significant matches 
in the hidden Markov model (HMM)-based PFAM database of conserved protein family domains. 
(See Table 3.) Data from MOTIFS analysis provides further corroborative evidence that SEQ ID 
N0:7 is a carboxypeptidase. 

20 As another example, SEQ ID NO: 10 is 46% identical, from residue R8 to residue S143, to 

mouse testatin, which is related to the cysteine protease inhibitors, cystatins (GenBank ID g3928491) 
as determined by the Basic Local Alignment Search Tool (BLAST). (See Table 2.) The BLAST 
probability score is 1.4e-27, which indicates the probability of obtaining the observed polypeptide 
sequence alignment by chance. SEQ ID NO: 10 also contains a cystatin domain as determined by 

25 searching for statistically significant matches in the hidden Markov model (HMM)-based PFAM 
database of conserved protein family domains. (See Table 3.) Data from further BLAST analysis 
provides corroborative evidence that SEQ ID NO: 10 is a cysteine protease inhibitor. 

As another example, SEQ ID NO:20 is 99% identical, from residue M17 to residue K267, to 
human kallikrein 14 (GenBank ID gl3897995) as determined by the Basic Local Alignment Search 

30 Tool (BLAST). (See Table 2.) The BLAST probability score is 2. le-136, which indicates the 
probability of obtaining the observed polypeptide sequence aUgnment by chance. SEQ ID NO:20 
also contains a trypsin domain as determined by searching for statistically significant matches in the 
hidden Markov model (HMM)-based PFAM database of conserved protein family domains. (See 
Table 3.) Data from BLIMPS, MOTIFS, and PROFILESCAN analyses provide further corroborative 

35 evidence that'SEQ ID NO:20 is a serine protease. 
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As another example, SEQ ID NO:27 is 96% identical, from residue Ml to residue T242, to 
human putative mast cell mMCP-7-like II tryptase (GenBank ID g4336577) as determined by the 
Basic Local Alignment Search Tool (BLAST). (See Table 2.) The BLAST probability score is 1.8e- 
130, which indicates the probability of obtaining the observed polypeptide sequence alignment by 
5 chance. SEQ ID NO:27 also contains a trypsin domain as determined ]by searching for statistically 
significant matches in the hidden Markov model (HMM)-based PFAM database of conserved protein 
family domains. (See Table 3.) Data from BLIMPS, MOTIFS, and PROFILESCAN analyses 
provide further coiroborative evidence that SEQ ID NO:27 is a trypsin-hke serine protease. 

SEQ ID NO:2-4, SEQ ID N0:6, SEQ ID NO:8-9, SEQ ID NO: 11-19, SEQ ID NO:21-26 and 
10 SEQ ID NO:28 were analyzed and annotated in a similar manner. The algorithms and parameters for 
the analysis of SEQ ID NO: 1-28 are described in Table 7. 

As shown in Table 4, the full length polynucleotide embodiments were assembled using 
cDNA sequences or coding (exon) sequences derived from genomic DNA, or any combination of 
these two types of sequences. Column 1 lists the polynucleotide sequence identification number 
15 (Polynucleotide SEQ ID NO:), the corresponding Incyte polynucleotide consensus sequence number 
(Incyte ID) for each polynucleotide of the invention, and the length of each polynucleotide sequence 
in basepairs. Column 2 shows the nucleotide start (5') and stop (3') positions of the cDNA and/or 
genomic sequences used to assemble the full length polynucleotide embodiments, and of fragments of 
die polynucleotides which are useful, for example, in hybridization or amplification technologies that 
20 identify SEQ ID NO:29-56 or that distinguish between SEQ ID NO:29-56 and related 
polynucleotides. 

The polynucleotide fragments described in Column 2 of Table 4 may refer specificaUy , for 
example, to Licyte cDNAs derived from tissue-specific cDNA libraries or from pooled cDNA 
libraries. Alternatively, the polynucleotide fragments described in column 2 may refer to GenBank 

25 cDNAs or ESTs which contributed to the assembly of the full length polynucleotides. In addition, the 
polynucleotide fragments described in column 2 may identify sequences derived from the ENSEMBL 
(The Sanger Centre, Cambridge, UK) database (z.e., those sequences including the designation 
'*ENST"). Alternatively, the polynucleotide fragments described in column 2 may be derived from 
the NCBI RefSeq Nucleotide Sequence Records Database (Le., those sequences including the 

30 designation "NM" or "NT") or the NCBI RefSeq Protein Sequence Records (Le. , those sequences 
including the designation "NP"). Alternatively, the polynucleotide fragments described in column 2 
may refer to assemblages of both cDNA and Genscan-predicted exons brought together by an "exon 
stitching" algorithm. For example, a polynucleotide sequence identified as 
FL_XXXXXX_Nj_N2_YYYYY_,N3_N4 represents a "stitched" sequence in which XXXXXX is the 

35 identification number of the cluster of sequences to which the algorithm was applied, and YYYYY is 
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the number of the prediction generated by the algorithm, and Nj 2,3.,.^ if present, represent specific 
exons that may have been manually edited during analysis (See Example V). Alternatively, the 
polynucleotide fragments in colunnn 2 may refer to assemblages of exons brought together by an 
"exon-stretching" algorithm. For example, a polynucleotide sequence identified as . 

5 V\JiXXXXK_sAAAAA^BBBBBJ._N is a "stretched" sequence, with XXXXXX being the Incyte 
project identification number, gAi4AAA being the GenBank identification number of the human 
genomic sequence to which the "exon-stretching" algorithm was applied, gBBBBB being the 
GenBank identification number or NCBI RefSeq identification number of the nearest GenBank 
protem homolog, and N referring to specific exons (See ExMiple V). In instances where a RefSeq 

10 sequence was used as a protein homolog for the "exon-stretching" algorithm, a RefSeq identifier 
(denoted by "NM,'* 'TSDP," or "NT") may be used in place of the GenBank identifier (/.e., gBBBBB), 

Altematively, a prefix identifies component sequences that were hand^edited, predicted from 
genomic DNA sequences, or derived from a combination of sequence analysis methods. The 
following Table lists exan:q)les of component sequence prefixes and corresponding sequence analysis 

15 methods associated with the prefixes (see Example IV and Example V). 



PrejRx 


Type of analysis and/or examples of programs 


GNN, GFG, 
ENST 


Exon prediction from genomic sequences using, for example, 
GENSCAN (Stanford University, CA, USA) or FGENES 
(Computer Genomics Group, The Sanger Centre, Cambridge, UK). 


GBI 


Hand-edited analysis of genomic sequences. 


EL 


Stitched or stretched genomic sequences (see Example V). 


INCY 


Full length transcript and exon prediction from mapping of EST 
sequences to the genome. Genomic location and EST composition 
data are combined to predict the exons and resulting transcript. 



In some cases, Incyte cDNA coverage redundant with the sequence coverage shown in Table 
4 was obtained to confirm the final consensus polynucleotide sequence, but the relevant Licyte cDNA 
25 identification numbers are not shown. 

Table 5 shows the representative cDNA libraries for those full length polynucleotides which 
were assembled using Incyte cDNA sequences. The representative cDNA library is the Incyte cDNA 
library which is most frequently represented by the Incyte cDNA sequences which were used to 
assemble and confirm the above polynucleotides. The tissues and vectors which were used to 
30 construct the cDNA libraries shown in Table 5 are described in Table 6. 

Table 8 shows single nucleotide polymorphisms (SNPs) found in polynucleotide 
embodiments, along with allele frequencies in different human populations. Columns 1 and 2 show 
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the polynucleotide sequence identification number (SEQ ID NO:) and the corresponding Incyte 
project identification number (PE)) for polynucleotides of the invention. Column 3 shows the Incyte 
identification number for the EST in which the SNP was detected (EST ID), and column 4 shows the 
identification number for the SNP (SNP ID). Column 5 shows the position within the EST sequence 
5 at which the SNP is located (EST SNP), and column 6 shows the position of the SNP within the full- 
length polynucleotide sequence (CBl SNP). Column 7 shows the allele found in the EST sequence. 
Columns 8 and 9 show the two alleles found at the SNP site. Column 10 shows the amino acid 
encoded by the codon including the SNP site, based upon the allele found in the EST. Columns 1 1- 
14 show the firequency of allele 1 in four different human populations. An entry of n/d (not detected) 

10 indicates that the frequency of allele 1 in the population was too low to be detected, while n/a (not 
available) indicates that the allele frequency was not determined for the population . 

The invention also encompasses PMOD variants. A preferred PMOD variant is one which 
has at least about 80%, or alternatively at least about 90%, or even at least about 95% amino acid 
sequence identity to the PMOD amino acid sequence, and which contains at least one functional or 

15 structural characteristic of PMOD. 

Various embodunents also encompass polynucleotides which encode PMOD. In a particular 
embodiment, the invention encompasses a polynucleotide sequence comprising a sequence selected 
firom the group consisting of SEQ ID NO:29-56, which encodes PMOD. The polynucleotide 
sequences of SEQ ID NO: 29-56. as presented in the Sequence Listing, embrace the equivalent RNA 

20 sequences, wherein occurrences of the nitrogenous base thymine are replaced with uracil, and the 
sugar backbone is composed of ribose instead of deoxyribose. 

The invention also encompasses variants of a polynucleotide encoding PMOD. In particular, 
such a variant polynucleotide will have at least about 70%, or alternatively at least about 85%, or 
even at least about 95% polynucleotide sequence identity to a polynucleotide encoding PMOD. A 

25 particular aspect of the invention encompasses a variant of a polynucleotide comprising a sequence 
selected from the group consisting of SEQ ID NO:29-56 which has at least about 70%, or 
alternatively at least about 85%, or even at least about 95% polynucleotide sequence identity to a 
nucleic acid sequence selected from the group consisting of SEQ ID NO:29-56. Any one of the 
polynucleotide variants described above can encode a polypeptide which contains at least one 

30 functional or structural characteristic of PMOD. 

In addition, or in the alternative, a polynucleotide variant of the invention is a splice variant 
of a polynucleotide encoding PMOD. A splice variant may have portions which have significant 
sequence identity to a polynucleotide encoding PMOD, but will generally have a greater or lesser 
number of polynucleotides due to additions or deletions of blocks of sequence arising from alternate 

35 splicing of exons durinjg mRNA processing. A splice variant may have less than about 70%, or 
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alternatively less than about 60%, or alternatively less than about 50% polynucleotide sequence 
identity to a polynucleotide encoding PMOD over its entire length; however, portions of the splice 
variant will have at least about 70%, or alternatively at least about 85%, or alternatively at least about 
95%, or alternatively 100% polynucleotide sequence identity to portions of the polynucleotide 
5 encoding PMOD. For exaniple, a polynucleotide comprising a sequence of SEQ ID N0:31 is a splice 
variant of a polynucleotide comprising a sequence of SEQ ID NO: 34, and a polynucleotide 
comprising a sequence of SEQ ID NO:44 is a splice variant of a polynucleotide comprising a 
sequence of SEQ ID NO:56. Any one of the splice variants described above can encode a 
polypeptide which contains at least one functional or structural characteristic of PMOD. 

10 It will be appreciated by those skilled in the art that as a result of the degeneracy of the 

genetic code, a multitude of polynucleotide sequences encoding PMOD, some bearing minimal 
similarity to the polynucleotide sequences of any known and naturally occurring gene, may be 
produced. Thus, the invention contemplates each and every possible variation of polynucleotide 
sequence that could be made by selecting combinations based on possible codon choices. These 

15 combinations are made in accordance with the standard triplet genetic code as applied to the 

polynucleotide sequence of naturally occurring PMOD, and all such variations are to be considered as 
being specifically disclosed. 

Although polynucleotides which encode PMOD and its variants are generally capable of 
hybridizing to polynucleotides encoding naturally occurring PMOD under appropriately selected 

20 conditions of stringency, it may be advantageous to produce polynucleotides encodmg PMOD or its 
derivatives possessing a substantially different codon usage, e.g., inclusion of non-naturally occurring 
codons. Codons may be selected to increase the rate at which expression of the peptide occurs in a 
particular prokaryotic or eukaryotic host in accordance with the frequency with which particular 
codons are utilized by the host. Other reasons for substantially altering the nucleotide sequence 

25 encoding PMOD and its derivatives without altering the encoded amino acid sequences include the 
production of RNA transcripts having more desirable properties, such as a greater half-life, than 
transcripts produced from the naturally occurring sequence. 

The invention also encompasses production of polynucleotides which encode PMOD and 
PMOD derivatives, or fragments thereof, entirely by synthetic chemistry. After production, the 

30 synthetic polynucleotide may be inserted into any of the many available expression vectors and cell 
systems using reagents well known in the art. Moreover, synthetic chemistry may be used to 
introduce mutations into a polynucleotide encoding PMOD or any fragment thereof. 

Embodiments of the invention can also include polynucleotides that are capable of 
hybridizing to the claimed polynucleotides, and, in particular, to those having the sequences shown in 

35 SEQ ID NO:29-56 and fragments thereof, under various conditions of stringency. (See, e.g., Wahl, 
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GM. and S.L. Berger (1987) Methods Enzymol. 152:399-407; Kimmel, A.R. (1987) Methods 
Enzymol. 152:507-511.) Hybridization conditions, including annealing and wash conditions, are 
described in "Definitions." 

Methods for DNA sequencing are well known in the art and may be used to practice any of 
5 the embodiments of the invention. The methods may employ such enzymes as the Klenow fragment 
of DNA polymerase I, SEQUENASE (US Biochemical, Cleveland OH), Taq polymerase (Applied 
Biosystems), thermostable T7 polymerase (Amersham Biosciences, Piscataway NJ), or combinations 
of polymerases and proofreading exonucleases such as those found in the ELONGASE amplification 
system (Invitrogen, Carlsbad CA). Preferably, sequence preparation is automated with machines such 

10 as the MICROLAB 2200 liquid transfer system (Hamilton, Reno NV), PTC200 thermal cycler (MJ 
Research, Watertown MA) and ABI CATALYST 800 thermal cycler (Applied Biosystems). 
Sequencuig is then carried out using either the ABI 373 or 377 DNA sequencing system (Applied 
Biosystems), the MEGABACE 1000 DNA sequencing system (Amersham Biosciences), or other 
systems known in the art. The resulting sequences are analyzed using a variety of algorithms which 

15 are well known in the art. (See, e.g., Ausubel, RM. (1997) Short Protocols in Molecular Biology , 
John Wiley & Sons, New York NY, unit 7.7; Meyers, R.A. (1995) Molecular Biology and 
Biotechnology . WUey VCH. New York NY, pp. 856-853.) 

The nucleic acids encoding PMOD may be extended utilizing a partial nucleotide sequence 
and en5)loying various PCR-based methods known in the art to detect upstream sequences, such as 

20. promoters and regulatory elements. For example, one method which may be employed, 

restriction-site PGR, uses universal and nested primers to amplify unknown sequence from genomic 
DNA within a cloning vector. (See, e.g., Sarkar, G. (1993) PCR Methods Applic. 2:318-322.) 
Another method, mverse PCR, uses primers that extend in divergent directions to amplify unknown 
sequence from a circularized template. The template is derived from restriction fragments comprising 

25 a known genomic locus and surrounding sequences. (See, e.g., Triglia, T. et al. (1988) Nucleic Acids 
Res. 16:8186.) A third method, capture PCR, involves PCR an^lification of DNA fragments 
adjacent to known sequences in human and yeast artificial chromosome DNA. (See, e.g., Lagerstrom, 
M. et al. (1991) PCR Methods Applic. 1:111-119.) In this method, multiple restriction enzyme 
digestions and ligations may be used to insert an engineered double-stranded sequence into a region 

30 of unknown sequence before performing PCR. Other methods which may be used to retrieve 
unknown sequences are known in the art. (See, e.g., Parker, J.D. et al. (1991) Nucleic Acids Res. 
19:3055-3060). Additionally, one may use PCR, nested primers, and PROMOTERFINDER libraries 
(Clontech, Palo Alto CA) to walk genomic DNA. This procedure avoids the need to screen libraries 
and is useful in finding intron/exon junctions. For all PCR-based methods, primers may be designed 

35 using commercially available software, such as OLIGO 4.06 primer analysis software (National 
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Biosciences, Plymouth MN) or another appropriate program, to be about 22 to 30 nucleotides in 
length, to have a GC content of about 50% or more, and to anneal to the template at temperatures of 
about 68°C to 72^C 

When screening for full length cDNAs, it is preferable to use libraries that have been 

5 size-selected to include larger cDNAs. In addition, random-primed libraries, which often include 
sequences containing the 5' regions of genes, are preferable for situations in which an oligo d(T) 
library does not yield a full-length cDNA. Genomic libraries may be useful for extension of sequence 
into 5' non-transcribed regulatory regions. 

Capillary electrophoresis systems which are commercially available may be used to analyze 

10 the size or confirm the nucleotide sequence of sequencing or PGR products. In particular, capillary 
sequencing may employ flowable polymers for electrophoretic separation, four different nucleotide- 
specific, laser-stimulated fluorescent dyes, and a charge coupled device camera for detection of the 
emitted wavelengths. Output/light intensity may be converted to electrical signal usmg appropriate 
software (e.g., GENOTYPER and SEQUENCE NAVIGATOR, Applied Biosystems), and the entire 

15 process from loading of samples to conq)uter analysis and electronic data display may be computer 
controlled. Capillary electrophoresis is especially preferable for sequencing small DNA fragments 
which may be present in lunited amounts in a particular sample. 

Iq another embodiment of the invention, polynucleotides or fragments thereof which encode 
PMOD may be cloned in recombinant DNA molecules that direct expression of PMOD, or fragments 

20 or functional equivalents thereof, in appropriate host cells. Due to the inherent degeneracy of the 
genetic code, other polynucleotides which encode substantially the same or a functionally equivalent 
polypeptides may be produced and used to express PMOD. 

The polynucleotides of the invention can be engineered using methods generally known in the 
art in order to alter PMOD-encoding sequences for a variety of purposes including, but not limited to, 

25 modification of the cloning, processing, and/or expression of the gene product. DNA shuffling by 
random fragmentation and PGR reassembly of gene fragments and synthetic oligonucleotides may be 
used to engineer the nucleotide sequences. For example, oligonucleotide-mediated site-directed 
mutagenesis may be used to introduce mutations that create new restriction sites, alter glycosylation 
patterns, change codon preference, produce splice variants, and so forth. 

30 The nucleotides of the present invention may be subjected to DNA shuffling techniques such 

as MOLECULARBREEDING (Maxygen Inc., Santa Clara CA; described in U.S. Patent No. 
5,837,458; Chang, C.-C. et al (1999) Nat. Biotechnol. 17:793-797; Christians, F.C. et al. (1999) Nat. 
BiotechnoL 17:259-264; and Crameri, A. et al. (1996) Nat. Biotechnol. 14:315-319) to alter or 
improve the biological properties of PMOD, such as its biological or enzymatic activity or its ability 

35 to bind to other molecules or compounds. DNA shuffling is a process by which a library of gene 
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variants is produced using PCR-mediated recombination of gene fragments. The library is then 
subjected to selection or screening procedures that identify those gene variants with the desired 
properties. These preferred variants may then be pooled and further subjected to recursive rounds of 
DNA shuffling and selection/screening. Thus, genetic diversity is created through "artificial" 
5 breeding and rapid molecular evolution. For example, fragments of a single gene containing random 
point niutations may be recombined, screened, and then reshuffled until the desired properties are 
optimized. Alternatively, fragments of a given gene may be recombined with fragments of 
homologous genes in the same gene family, either from the same or different species, thereby 
maximizing the genetic diversity of multiple naturally occurring genes in a directed and controllable 
10 manner. 

In another embodiment, polynucleotides encoding PMOD may be synthesized, in whole or in 
part, using one or more chemical methods well known in the art. (See, e.g., Caruthers, M.H. et al. 
(1980) Nucleic Acids Symp. Ser. 7:215-223; and Horn, T. et al. (1980) Nucleic Acids Symp. Ser. 
7:225-232.) Altematively, PMOD itself or a fragment thereof may be synthesized using chemical 

15 mediods known in the art. For example, peptide synthesis can be performed using various solution- 
phase or solid-phase techniques. (See, e.g., Creighton, T. (1984) Ptoteins, Structures and Molecular 
Properties . WH Freeman, New York NY, pp. 55-60; and Roberge, J.Y. et al. (1995) Science 
269:202-204.) Automated synthesis may be achieved using the ABI 431 A peptide synthesizer 
(Applied Biosystems). Additionally, the amino acid sequence of PMOD, or any part thereof, may be 

20 altered during direct synthesis and/or combined with sequences from other proteins, or any part 

thereof, to produce a variant polypeptide or a polypeptide having a sequence of a naturally occurring 
polypeptide. 

The peptide may be substantially purified by preparative high performance liquid 
chromatography. (See, e.g., Chiez, R.M. and RZ. Regnier (1990) Methods Enzymol. 182:392-421.) 

25 The composition of the synthetic peptides may be confirmed by amino acid analysis or by 
sequencing. (See, e.g., Creighton, supra, pp. 28-53.) 

In order to express a biologically active PMOD, the polynucleotides encoding PMOD or 
derivatives thereof may be inserted into an appropriate expression vector, i.e., a vector which contains 
the necessary elements for transcriptional and translational control of the inserted coding sequence in 

30 a suitable host. These elements include regulatory sequences, such as enhancers, constitutive and 
inducible promoters, and 5' and 3' untranslated regions in the vector and in polynucleotides encoding 
PMOD. Such elements may vary in their strength and specificity. Specific initiation signals may also 
be used to achieve more efficient translation of polynucleotides encoding PMOD. Such signals 
include the ATG initiation codon and adjacent sequences, e.g. the Kozak sequence. In cases where a 

35 polynucleotide sequence encoding PMOD and its initiation codon and upstream regulatory sequences 
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axe inserted into the appropriate expression vector, no additional transcriptional or translational 
control signals may be needed. However, in cases where only coding sequence, or a fragment 
thereof, is inserted, exogenous translational control signals including an in-frame ATG initiation 
codon should be provided by the vector. Exogenous translational elements and initiation codons may 

5 be of various origins, both natural and synthetic. The efficiency of expression may be enhanced by 
the inclusion of enhancers appropriate for the particular host cell system used. (See, e.g., Scharf, D. 
et al. (1994) Results Probl. Cell Differ. 20:125-162.) 

Methods which are well known to those skilled in the art may be used to construct expression 
vectors containing polynucleotides encoding PMOD and appropriate transcriptional and translational 

10 control elements. These methods include in vitro recombinant DNA techniques, synthetic techniques, 
and in vivo genetic recombination. (See, e.g., Sambrook, J. et al. (1989) Molecular Cloning. A 
Laboratorv Manual , Cold Spring Harbor iPress, Plainview NY, ch. 4, 8, and 16-17; Ausubel, P.M. et 
al. (1995) C:urrent Protocols in Molecular Biologv . John Wiley & Sons, New York NY, ch. 9, 13, and 
16.) 

15 A variety of expression vector/host systems may be utilized to contain and express 

polynucleotides encoding PMOD. These include, but are not limited to, microorganisms such as 
bacteria transformed with recombinant bacteriophage, plasmid, or cosmid DNA expression vectors; 
yeast transformed with yeast expression vectors; insect cell systems infected with viral expression 
vectors (e.g., baculovkus); plant cell systems transformed with viral expression vectors (e.g., 

20 cauliflower mosaic virus, CaMV, or tobacco mosaic virus, TMV) or with bacterial expression vectors 
(e.g., Ti or pBR322 plasmids); or animal cell systems. (See, e.g., Sambrook, supra; Ausubel, supra; 
Van Heeke, G. and S.M. Schuster (1989) J. Biol. Chem. 264:5503-5509; Engelhard, E.K. et al. (1994) 
Proc. Natl. Acad. Sci. USA 91:3224-3227; Sandig, V. et al. (1996) Hum. Gene Ther. 7:1937-1945; 
Takamatsu, N. (1987) EMBO J. 6:307-31 1; The McGraw Hill Yearbook of Science and Technology 

25 (1992) McGraw Hill, New York NY, pp. 19M96; Logan, J. and T. Shenk (1984) Proc. Natl. Acad, 
Sci. USA 81:3655-3659; and Harrington, J.J. et al. (1997) Nat. Genet. 15:345-355.) Expression 
vectors derived from retroviruses, adenoviruses, or herpes or vaccinia viruses, or from various 
bacterial plasmids, may be used for delivery of polynucleotides to the targeted organ, tissue, or cell 
population. (See, e.g., Di Nicola, M. et al. (1998) Cancer Gen, Ther. 5(6):350-356; Yu, M. et al. 

30 (1993) Proc. Natl. Acad. Sci. USA 90(13):6340-6344; BuUer, R.M, et al. (1985) Nature 

317(6040):813-815; McGregor, D.P. et al. (1994) Mol. Immunol. 31(3):219-226; and Verma, I.M. 
and N. Somia (1997) Nature 389:239-242.) The invention is not limited by the host cell employed. 

In bacterial systems, a number of cloning and expression vectors may be selected depending 
upon the use intended for polynucleotides encoding PMOD. For example, routine cloning, 

35 subcloning, and propagation of polynucleotides encoding PMOD can be achieved using a 
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multifunctional E. coli vector such as PBLUESCRIPT (Stratagene, La JoUa CA) or PSPORTl 
plasmid (Invitrogen). Ligation of polynucleotides encoding PMOD into the vector's multiple cloning 
site disrupts the lacZ gene, allowing a colorimetric screening procedure for identification of 
transformed bacteria containing recombinant molecules. In addition, these vectors may be useful for 
5 in vitro transcription, dideoxy sequencing, single strand rescue v^ith helper phage, and creation of 
nested deletions in the cloned sequence. (See, e.g., Van Heeke, G. and S.M. Schuster (1989) J. Biol. 
Chem. 264:5503-5509.) When large quantities of PMOD are needed, e.g. for the production of 
antibodies, vectors which direct high level expression of PMOD may be used. For example, vectors 
containing the strong, inducible SP6 or T7 bacteriophage promoter may be used. 

10 Yeast expression systems may be used for production of PMOD. A number of vectors 

containing constitutive or inducible promoters, such as alpha factor, alcohol oxidase, and PGH 
promoters, may be used in the yeast Saccharomyces cerevisiae or Pichia pastoris. In addition, such 
vectors direct either the secretion or intracellular retention of expressed proteins and enable 
integration of foreign polynucleotide sequences into the host genome for stable propagation. (See, 

15 e.g., Ausubel, 1995, supra; Bitter, G.A. et al. (1987) Methods Enzymol. 153:516-544; and Scorer, 
CA. et al. (1994) Bio/Technology 12:181-184.) 

Plant systems may also be used for expression of PMOD. Transcription of polynucleotides 
encoding PMOD may be driven by viral promoters, e.g., the 35S and 19S promoters of CaMV used 
alone or in combination with the omega leader sequence from TMV (Takamatsu, N. (1987) EMBO J. 

20 6:307-311). Alternatively, plant promoters such as the small subunit of RUBISCO or heat shock 

promoters may be used. (See, e.g.. Coruzzi, G. et al. (1984) EMBO J. 3:1671-1680; Broglie, R. et al.~ 
(1984) Science 224:838-843; and Winter, J. et al. (1991) Results Probl. Cell Differ. 17:85-105.) 
These constructs can be introduced into plant cells by direct DNA transformation or 
pathogen-mediated transfection. (See, e.g.. The McGraw Hill Yearbook of Science and Teclmology 

25 (1992) McGraw Hill, New York NY, pp. 191-196.) 

hi mammalian cells, a number of viral-based expression systems may be utilized. In cases 
where an adenovirus is used as an expression vector, polynucleotides encoding PMOD may be ligated 
into an adenovirus transcription/translation complex consisting of the late promoter and tripartite 
leader sequence. Insertion in a non-essential El or E3 region of the viral genome may be used to 

30 obtain infective virus which expresses PMOD in host cells. (See, e.g., Logan, J. and T. Shenk (1984) 
Proc. Natl. Acad. Sci. USA 81:3655-3659.) In addition, transcription enhancers, such as the Rous 
sarcoma virus (RSV) enhancer, may be used to increase expression in mammalian host cells. SV40 
. or EBV"based vectors may also be used for high-level protein expression. 

Human artificial chromosomes (HACs) may also be employed to deliver larger fragments of 

35 DNA than can be contained in and expressed from a plasmid. HACs of about 6 kb to 10 Mb are 
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constructed and delivered via conventional delivery methods (liposomes, polycationic amino 
polymers, or vesicles) for therapeutic purposes. (See, e.g., Harrington, J.J. et al. (1997) Nat. Genet. 
15:345-355.) 

For long term production of recombinant proteins in mammalian systems, stable expression 

5 of PMOD in cell lines is preferred. For example, polynucleotides encoding PMOD can be 

transformed into cell lines using expression vectors which may contain viral origins of replication 
and/or endogenous expression elements and a selectable marker gene on the same or on a separate 
vector. FoUov^ing the introduction of the vector, cells may be allowed to grow for about 1 to 2 days 
in enriched media before being switched to selective media. The purpose of the selectable marker is 

10 to confer resistance to a selective agent, and its presence allows growth and recovery of cells which 
successfully express the introduced sequences. Resistant clones of stably transformed cells may be 
propagated using tissue culture techniques appropriate to the cell type. 

Any mmiber of selection systems may be used to recover transformed cell lines. These 
include, but are not limited to, the herpes simplex virus thymidine kinase and adenine 

15 phosphoribosyltransferase genes, for use in tk' and apr cells, respectively. (See, e.g., Wigler, M. et 
al. (1977) Cell 11:223-232; Lowy, 1. et al. (1980) Cell 22:817-823.) Also, antimetabolite, antibiotic, 
or herbicide resistance can be used as the basis for selection. For example, dhfr confers resistance to 
methotrexate; neo confers resistance to the aminoglycosides neomycin and G-418; and als and pat 
confer resistance to chlorsulfiiron and phosphinotricin acetyltransferase, respectively. (See, e.g., 

20 Wigler, M. et al. (1980) Proc. Natl. Acad. Sci. USA 77:3567-3570; Colbere-Garapin, F. et al. (1981) 
J. Mol. Biol. 150:1-14.) Additional selectable genes have been described, e.g., trpB and hisD, which 
alter cellular requirements for metabolites. (See, e.g., Hartman, S.C. and R.C. Mulligan (1988) Proc. 
Natl. Acad. Sci. USA 85:8047-8051.) Visible markers, e.g., anthocyanins, green fluorescent proteins 
(GFP; Clontech), B glucuronidase and its substrate B-glucuronide, or luciferase and its substrate 

25 luciferin may be used. These markers can be used not only to identify transformants, but also to 
quantify the amount of transient or stable protein expression attributable to a specific vector system. 
(See, e.g., Rhodes, C.A. (1995) Methods Mol. Biol. 55:121-131.) 

Although the presence/absence of marker gene expression suggests that the gene of interest is 
also present, the presence and expression of the gene may need to be confirmed. For example, if the 

30 sequence encoding PMOD is inserted within a marker gene sequence, transformed cells containing 
polynucleotides encoding PMOD can be identified by the absence of marker gene function. . 
Alternatively, a marker gene can be placed in tandem with a sequence encoding PMOD under the 
control of a single promoter. Expression of the marker gene in response to induction or selection 
usually indicates expression of the tandem gene as well. 

35 In general, host cells that contain the polynucleotide encoding PMOD and that express 
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PMOD may be identified by a variety of procedures known to those of skill in the art. These 
procedures include, but are not limited to, DNA-DNA or DNA-RNA hybridizations, PGR 
amplification, and protein bioassay or immunoassay techniques which include membrane, solution, or 
chip based technologies for the detection and/or quantification of nucleic acid or protein sequences. 

5 hnmunological methods for detecting and measuring the expression of PMOD using either 

specific polyclonal or monoclonal antibodies are known in the art. Examples of such techniques 
include enzyme-linked immunosorbent assays (ELISAs), radioimmunoassays (RIAs), and 
fluorescence activated cell sorting (FACS). A two-site, monoclonal-based immunoassay utilizing 
monoclonal antibodies reactive to two non-interfering epitopes on PMOD is preferred, but a 

10 competitive binding assay may be employed. These and other assays are well known in the art. (See, 
e.g., Hampton, R. et al. (1990) Serological Methods, a Laboratory Manual , APS Press, St Paul MN, 
Sect. IV; Coligan, J.E. et al. (1997) Current Protocols in Lnmunologv , Greene Pub. Associates and 
Wiley-Interscience, New York NY; and Pound, J.D. (1998) Immunochemical Protocols , Humana 
Press, TotowaNJ.) 

15 A wide variety of labels and conjugation techniques are known by those skilled in the art and 

may be used in various nucleic acid and amino acid assays. Means for producing labeled 
hybridization or PCR probes for detecting sequences related to polynucleotides encoding PMOD 
include oligolabeling, nick translation, end-labeling, or PGR amplification using a labeled nucleotide. 
Alternatively, polynucleotides encoding PMOD, or any fragments thereof, may be cloned into a 

20 vector for the production of an mRNA probe. Such vectors are known in the art, are commercially 
available, and may be used to synthesize RNA probes in vitro hy addition of an appropriate RNA 
polymerase such as T7, T3, or SP6 and labeled nucleotides. These procedures may be conducted 
using a variety of commercially available kits, such as those provided by Amersham Biosciences, 
Promega (Madison WI), and US Biochemical. Suitable reporter molecules or labels which may be 

25 used for ease of detection include radionuclides, enzymes, fluorescent, chemiluminescent, or 
chromogenic agents, as well as substrates, cofactors, inhibitors, magnetic particles, and the like. 

Host cells transformed with polynucleotides encoding PMOD may be cultured under 
conditions suitable for the expression and recovery of the protein from cell culture. The protein 
produced by a transformed cell may be secreted or retained intracellularly depending on the sequence 

30 and/or the vector used. As will be understood by those of skill in the art, expression vectors 

containing polynucleotides which encode PMOD may be designed to contain signal sequences which 
direct secretion of PMOD through a prokaryotic or eukaryotic cell membrane. 

In addition, a host cell strain may be chosen for its ability to modulate expression of the 
inserted polynucleotides. or to process the expressed protein in the desired fashion. Such 

35 modifications of the polypeptide include, but are not limited to, acetylation, carboxylation, 
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glycosylation, phosphorylation, lipidation, and acylation. Post-translational processing which cleaves 
a "prepro" or "pro" form of the protein may also be used to specify protein targeting, folding, and/or 
activity. Different host cells which have specific cellular machinery and characteristic mechanisms 
for post-translational activities (e.g., CHO, HeLa, MDCK, HEK293, and WD8) are available from the 

5 American Type Culture Collection (ATCC, Manassas VA) and may be chosen to ensure the correct 
modification and processing of the foreign protein. 

In another embodiment of the invention, natural, modified, or recombinant polynucleotides 
encoding PMOD may be ligated to a heterologous sequence resulting m translation of a fusion protein 
in any of the aforementioned host systems. For example, a chimeric PMOD protein containmg a 

10 heterologous moiety that can be recognized by a coiiraiercially available antibody may facilitate the 
screening of peptide libraries for inhibitors of PMOD activity. Heterologous protein and peptide 
moieties may also facilitate purification of fusion proteins using commercially available aflBnity 
matrices. Such moieties include, but are not limited to, glutathione S-transferase (GST), maltose 
bmding protein (MBP), thioredoxin (Trx), calmodulin binding peptide (CBP), 6-His, FLAG, c-myc, 

15 and hemagglutinin (HA). GST, MBP, Trx, CBP, and 6-His enable purification of their cognate fusion 
proteins on immobilized glutathione, maltose, phenylarsine oxide, calmodulin, and metal-chelate 
resins, respectively. FLAG, c-myc, and hemagglutmin (HA) enable immunoaffinity purification of 
fusion proteins using commercially available monoclonal and polyclonal antibodies that specifically 
recognize these epitope tags. A fusion protein may also be engineered to contain a proteolytic 

20 cleavage site located between the PMOD encoding sequence and the heterologous protein sequence, 
so that PMOD may be cleaved away from the heterologous moiety following purification. Methods 
for fusion protein expression and purification are discussed in Ausubel (1995, supra, ch. 10). A 
variety of conmiercially available kits may also be used to facilitate expression and purification of 
fusion proteins. 

25 In another embodiment, synthesis of radiolabeled PMOD may be achieved in vitro using the 

TNT rabbit reticulocyte lysate or wheat germ extract system (Promega). These systems couple 
transcription and translation of protein-coding sequences operably associated with the T7, T3, or SP6 
promoters. Translation takes place in the presence of a radiolabeled amino acid precursor, for 
example, ^^S-methionine. 

30 PMOD, fragments of PMOD, or variants of PMOD may be used to screen for compounds 

that specifically bind to PMOD. One or more test compounds may be screened for specific binding 
to PMOD. In various embodiments, 1, 2, 3, 4, 5, 10, 20, 50, 100, or 200 test compounds can be 
screened for specific binding to PMOD. Examples of test compounds can include antibodies, 
anticalins, oUgonucleotides, proteins (e.g., ligands or receptors), or small molecules. 

35 In related embodiments, variants of PMOD can be used to screen for binding of test 
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compounds, such as antibodies, to PMOD, a variant of PMOD, or a combination of PMOD and/or 
one or more variants PMOD. In an embodiment, a variant of PMOD can be used to screen for 
compounds that bind to a variant of PMOD, but not to PMOD having the exact sequence of a 
sequence of SEQ ID NO: 1-28. PMOD variants used to perform such screening can have a range of 
about 50% to about 99% sequence identity to PMOD, with various embodiments having 60%, 70%, 
75%. 80%, 85%, 90%, and 95% sequence identity. 

In an embodiment, a compound identified in a screen for specific binding to PMOD can be 
closely related to the natural ligand of PMOD, e.g., a ligand or fragment thereof, a natural substrate, a 
stractural or functional mimetic, or a natural binding partner. (See, e.g., Coligan, J.E. et al. (1991) 
Current Protocols in Immunology l(2):Chapter 5.) In another embodiment, the compound thus 
identified can be a natural ligand of a receptor PMOD. (See, e.g., Howaid, A.D. et al. (2001) Trends 
Phannacol. Sci.22: 132-140; Wise, A. et al. (2002) Drug Discovery Today 7:235-246.) 

In other embodiments, a compound identified in a screen for specific binding to PMOD can 
be closely related to the natural receptor to which PMOD binds, at least a fiagment of the receptor, or 
15 a fragment of the receptor mcluding aU or a portion of the ligand bmding site or binding pocket. For 
example, the confound may be a receptor for PMOD which is capable of propagating a signal, or a 
decoy receptor for PMOD which is not capable of propagating a signal (Ashkenazi, A. and V.M. 
Divit (1999) Cuir. Opin. Cell Biol. 1 1:255-260; Mantovani, A. et al. (2001) Trends Immunol. 22:328-. 
336). The compound can be rationally designed using known techniques. Examples of such 
techniques uiclude those used to construct the compound etanercept (ENBREL; Immunex Corp., 
Seattle WA), which is efficacious for treating rheumatoid arthritis in humans. Etanercept is an 
engineered p75 tumor necrosis factor (TNF) receptor dimer linked to the Fc portion of human IgG , 
(Taylor, P.C. et al. (2001) Cuir. Opin. Immunol. 13i61 1-616). 

In one embodiment, two or more antibodies having similar or, alternatively, different 
specificities can be screened for specific binding to PMOD, fragments of PMOD, or variants of 
PMOD. The binding specificity of the antibodies thus screened can thereby be selected to identify 
particular fragments or variants of PMOD. In one embodiment, an antibody can be selected such that 
its binding specificity allows for preferential identification of specific fragments or variants of 
PMOD. In another embodiment, an antibody can be selected such that its binding specificity allows 
for preferential diagnosis of a specific disease or condition having increased, decreased, or otherwise 
abnormal production of PMOD. 

In an embodunent, anticalins can be screened for specific binding to PMOD, fiagments of 
PMOD, or variants of PMOD. Anticalins are ligand-binding proteins that have been constructed 
based on a lipocalin scaffold (Weiss, G.A. and H.B. Lowman (2000) Chem. Biol. 7:R177-R184; 
Skeira, A. (2001) J. Biotechnol. 74:257-275). The protein architecture of lipocalins can include a 
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beta-barrel having eight antiparallel beta-strands, which supports four loops at its open end. These 
loops form the natural ligand-binding site of the lipocalins, a site which can be re-engineered in vitro 
by amino acid substitutions to impart novel binding specificities. The amino acid substitutions can be 
made using methods known in the art or described herein, and can include conservative substitutions 
5 (e.g., substitutions that do not alter binding specificity) or substitutions that modestly, moderately, or 
significantly alter binding specificity. 

In one embodiment, screening for compounds which specifically bind to, stimulate, or inhibit 
PMOD involves producing appropriate cells which express PMOD, either as a secreted protein or on 
the cell membrane. Preferred cells include cells firom mammals, yeast, Drosophila, or E. colt Cells 
10 expressing PMOD or cell membrane fi-actions which contain PMOD are then contacted with a test 
compound and binding, stimulation, or inhibition of activity of either PMOD or the compound is 
analyzed. 

An assay may simply test binding of a test compound to the polypeptide, wherein binding is 
detected by a fluorophore, radioisotope, enzyme conjugate, or other detectable label. For example, 

15 the assay may comprise the steps of combining at least one test compound with PMOD, either in 
solution or affixed to a solid support, and detectmg the binding of PMOD to the compound. 
Alternatively, the assay may detect or measure binding of a test compound in the presence of a 
labeled competitor. Additionally, the assay may be carried out using cell-free preparations, chemical 
libraries, or natural product mixtures, and the test compound(s) may be free in solution or affixed to a 

20 solid support. 

An assay can be used to assess the ability of a compound to bind to its natural ligand and/or 
to inhibit the binding of its natural ligand to its natural receptors. Examples of such assays include 
radio-labeling assays such as those described in U.S. Patent No. 5,914,236 and U.S. Patent No. 
6,372,724. In a related embodiment, one or more amino acid substitutions can be introduced into a 

25 polypeptide compound (such as a receptor) to improve or alter its ability to bind to its natural ligands. 
(See, e.g., Matthews, DJ. and J.A. Wells. (1994) Chem. Biol. 1:25-30.) In another related 
embodiment, one or more amino acid substitutions can be introduced into a polypeptide compound 
(such as a ligand) to improve or alter its ability to bind to its natural receptors. (See, e.g., 
Cunningham, B.C. and J.A. Wells (1991) Proc. Natl. Acad. Sci. USA 88:3407-3411; Lowman, H.B. 

30 et al. (1991) J. Biol. Chem. 266: 10982-10988.) 

PMOD, fragments of PMOD, or variants of PMOD nfiay be used to screen for compounds 
that modulate the activity of PMOD. Such compoimds may include agonists, antagonists, or partial 
or inverse agonists. In one embodiment, an assay is performed under conditions permissive for 
PMOD activity, wherein PMOD is combined with at least one test compound, and the activity of 

35 PMOD in the presence of a test compound is compared with the activity of PMOD in the absence of 
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the test compound. A change in the activity of PMOD in the presence of the test compound is 
indicative of a compound that modulates the activity of PMOD. Alternatively, a test compound is 
combined with an in vitro or cell-free system comprising PMOD under conditions suitable for PMOD 
activity, and the assay is performed. In either of these assays, a test compound which modulates the 
5 activity of PMOD may do so indirectly and need not come in direct contact with the test compound. 
At least one and up to a plurality of test compounds may be screened. 

In another embodiment, polynucleotides encoding PMOD or their mammalian homologs may 
be "knocked out" in an animal model system using homologous recombination in embryonic stem 
(ES) cells. Such techniques are well known in the art and are useful for the generation of animal 

10 models of human disease. (See, e.g., U.S. Patent No. 5,175,383 and U.S. Patent No. 5,767,337.) For 
example, mouse ES cells, such as the mouse 129/SvJ cell line, are derived from the early mouse 
embryo and grown in culture. The ES cells are transformed with a vector containing the gene of 
interest disrupted by a marker gene, e.g., the neomycin phosphotransferase gene (neo; Capecchi, M.R. 
(1989) Science 244: 1288-1292). The vector integrates mto the corresponding region of the host 

15 genome by homologous recombination. Alternatively, homologous recombination takes place using 
the Cre-loxP system to knockout a gene of interest in a tissue- or developmental stage-specific 
manner (Marth, J.D. (1996) Clin. lavest. 97:1999-2002; Wagner, K.U. et al. (1997) Nucleic Acids 
Res. 25:4323-4330). Transformed ES cells are identified and microinjected into mouse cell 
blastocysts such as those from the C57BL/6 mouse strain. The blastocysts are surgically transferred 

20 to pseudopregnant dams, and the resulting chimeric progeny are genotyped and bred to produce 

heterozygous or homozygous strains. Transgenic animals thus generated may be tested with potential 
therapeutic or toxic agents. 

Polynucleotides encoding PMOD may also be manipulated in vitro in ES cells derived from 
human blastocysts. Human ES cells have the potential to differentiate mto at least eight separate cell 

25 lineages including endoderm, mesoderm, and ectodermal cell types. These cell lineages differentiate 
into, for example, neural cells, hematopoietic lineages, and cardiomyocytes (Thomson, J.A. et al. 
(1998) Science 282: 1 145-1 147). 

Polynucleotides encoding PMOD can also be used to create "knockin" humanized animals 
(pigs) or transgenic animals (mice or rats) to model human disease. With knockin technology, a 

30 region of a polynucleotide encodmg PMOD is injected mto animal ES cells, and the injected 

sequence integrates into the animal cell genome. Transformed cells are injected into blastulae, and 
the blastulae are implanted as described above. Transgenic progeny or inbred lines are studied and 
treated with potential pharmaceutical agents to obtain information on treatment of a human disease. 
Alternatively, a mammal inbred to overexpress PMOD, e.g., by secreting PMOD m its milk, may also 

35 serve as a convenient source of that protein (Janne, J. et al. (1998) BiotechnoL Annu. Rev. 4:55-74). 
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THERAPEUTICS 

Chemical and structural similarity, e.g., in the context of sequences and motifs, exists 
between regions of PMOD and protein modification and maintenance molecules. La addition, the 
expression of PMOD is closely associated with epithilial, brain, brain tumor, ileum, lymph node, 
liver, ovarian, placental, prostate, cerebellum, pituitary gland, small mtestine, and testis tissues and 
promonocyte cells . Further examples of tissues expressing PMOD can be found in Table 6 and can 
also be found in Example XI. Therefore, PMOD appears to play a role in gastrointestmal, 
cardiovascular, autoimmune/inflainmatory, cell proliferative, developmental, epithelial, neurological, 
and reproductive disorders. In the treatment of disorders associated with increased PMOD expression 
or activity, it is deskable to decrease the expression or activity of PMOD. In the treatment of 
disorders associated with decreased PMOD expression or activity, it is desirable to increase the 
expression or activity of PMOD. 

Therefore, in one embodiment, PMOD or a fragment or derivative thereof may be 
administered to a subject to treat or prevent a disorder associated with decreased expression or 
activity of PMOD. Examples of such disorders include, but are not limited to, a gastrointestinal 
disorder, such as dysphagia, peptic esophagitis, esophageal spasm, esophageal stricture, esophageal 
carcinoma, dyspepsia, indigestion, gastritis, gastric carcinoma, anorexia, nausea, emesis, 
gastroparesis, antral or pyloric edema, abdominal angina, pyrosis, gastroenteritis, intestinal 
obstruction, infections of the intestinal tract, peptic ulcer, cholelithiasis, cholecystitis, cholestasis, 
pancreatitis, pancreatic carcinoma, biliary tract disease, hepatitis, hyperbilirubinemia, cirrhosis, 
passive congestion of the liver, hepatoma, infectious colitis, ulcerative colitis, ulcerative proctitis, 
Crohn's disease, Whipple's disease, Mallory-Weiss syndrome, colonic carcinoma, colcmic 
obstruction, hritable bowel syndrome, short bowel syndrome, diarrhea, constipation, gastrointestinal 
hemorrhage, acquired immunodeficiency syndrome (AIDS) enteropathy, jaundice, hepatic 
encephalopathy, hepatorenal syndrome, hepatic steatosis, hemocliromatosis, Wilson's disease, alpha^- 
antitrypsin deficiency, Reye's syndrome, primary sclerosing cholangitis, hver infarction, portal vein 
obstruction and thrombosis, centrilobular necrosis, peliosis hepatis, hepatic vein thrombosis, veno- 
occlusive disease, preeclampsia, eclampsia, acute fatty liver of pregnancy, intrahepatic cholestasis of 
pregnancy, and hepatic tumors including nodular hyperplasias, adenomas, and carcinomas; a 
cardiovascular disorder, such as arteriovenous fistula, atlierosclerosis, hypertension, vasculitis, 
Raynaud's disease, aneurysms, arterial dissections, varicose vems, thrombophlebitis and 
phlebothrombosis, vascular tumors, and complications of thrombolysis, balloon angioplasty, vascular 
replacement, and coronary artery bypass graft surgery, congestive heart failure, ischemic heart 
disease, angina pectoris, myocardial infarction, hypertensive heart disease, degenerative valvular 
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heart disease, calcific aortic valve stenosis, congenitally bicuspid aortic valve, mitral annular 
calcification, mitral valve prolapse, rheumatic fever and rheumatic heart disease, infective 
endocarditis, nonbacterial thrombotic endocarditis, endocarditis of systemic lupus erythematosus, 
carcinoid heart disease, cardiomyopathy, myocarditis, pericarditis, neoplastic heart disease, 
5 congenital heart disease, and complications of cardiac transplantation; an autoimmune/inflammatory 
disease, such as acquired immunodeficiency syndrome (An)S), Addison's disease, adult respiratory 
distress syndrome, allergies, ankylosing spondylitis, amyloidosis, anenma, asthma, atherosclerosis, 
atherosclerotic plaque rupture, autoimmune hemolytic anemia, autoimmune thyroiditis, autoimmune 
polyendocrinopathy-candidiasis-ectodermal dystrophy (APECED), bronchitis, cholecystitis, contact 

10 dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis, diabetes mellitus, emphysema, 

episodic lymphopenia with lymphocytotoxins, erythroblastosis fetalis, erythema nodosum, atrophic 
gastritis, glomerulonephritis, Goodpasture's syndrome, gout, Graves' disease, Hashimoto's 
thyroiditis, hypereosinophilia, irritable bowel syndrome, multiple sclerosis, myasthenia gravis, 
myocardial or pericardial inflammation, osteoarthritis, degradation of articular cartilage, osteoporosis, 

15 pancreatitis, polymyositis, psoriasis, Reiter's syndrome, rheumatoid arthritis, scleroderma, Sj5gren's 
syndrome, systemic anaphylaxis, systemic lupus erythematosus, systemic sclerosis, thrombocytopenic 
purpura, ulcerative colitis, uveitis, Werner syndrome, complications of cancer, hemodialysis, and 
extracorporeal circulation, viral, bacterial, fungal, parasitic, protozoal, and helminthic infections, and 
trauma; a cell proliferative disorder such as actinic keratosis, arteriosclerosis, atherosclerosis, bursitis, 

20 cirrhosis, hepatitis, mixed connective tissue disease (MCTD), myelofibrosis, paroxysmal nocturnal 
hemoglobinuria, polycythemia vera, psoriasis, primary thrombocythemia, and cancers including 
adenocarcinoma, leukenma, lyn^homa, melanoma, myeloma, sarcoma, teratocarcinoma, and, in 
particular, cancers of the adrenal gland, bladder, bone, bone marrow, brain, breast, cervix, gall 
bladder, ganglia, gastrointestinal tract, heart, kidney, liver, lung, muscle, ovary, pancreas, parathyroid, 

25 penis, prostate, salivary glands, skin, spleen, testis, thymus, thyroid, and uterus; a developmental 
disorder, such as renal tubular acidosis, anemia, Gushing' s syndrome, achondroplastic dwarfism, 
Duchenne and Becker muscular dystrophy, bone resorption, epilepsy, gonadal dysgenesis, WAGR 
syndrome (Wilms' tumor, aniridia, genitourinary abnormalities, and mental retardation). Smith- 
Magenis syndrome, myelodysplastic syndrome, hereditary mucoepithelial dysplasia, hereditary 

30 keratodermas, hereditary neuropathies such as Charcot-Marie-Tooth disease and neurofibromatosis, 
hypothyroidism, hydrocephalus, seizure disorders such as Syndenham's chorea and cerebral palsy, 
spina bifida, anencephaly, craniorachischisis, congenital glaucoma, cataract, age-related macular 
degeneration, and sensorineural hearing loss; an epithelial disorder, such as dyshidrotic eczema, 
allergic contact dermatitis, keratosis pilaris, melasma, vitiligo, actinic keratosis, basal cell carcinoma, 

35 squamous cell carcinoma, seborrheic keratosis, folliculitis, herpes simplex, herpes zoster, varicella, 
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candidiasis, dermatophytosis, scabies, insect bites, cherry angioma, keloid, dermatofibroma, 
acrochordons, urticaria, transient acantholytic dermatosis, xerosis, eczema, atopic dermatitis, contact 
dermatitis, hand eczema, nummular eczema, lichen simplex chronicus, asteatotic eczema, stasis 
dermatitis and stasis ulceration, seborrheic dermatitis, psoriasis, lichen planus, pityriasis rosea, 
5 impetigo, ecthyma, dermatophytosis, tinea versicolor, warts, acne vulgaris, acne rosacea, penq>higus 
vulgaris, pemphigus foliaceus, paraneoplastic pemphigus, bullous pemphigoid, herpes gestationis, 
dermatitis herpetiformis, linear IgA disease, epidennolysis bullosa acquisita, demoatomyositis, lupus 
erythematosus, scleroderma and morphea, erythroderma, alopecia, figurate skin lesions, 
telangiectasias, hypopigmentation, hyperpigmentation, vesicles/buUae, exanthems, cutaneous drug 

10 reactions, papulonodular skin lesions, chronic non-healing wounds, photosensitivity diseases, 

epidermolysis bullosa simplex, epidermolytic hyperkeratosis, epidermolytic and nonepidermolytic 
pahnoplantar keratoderma, ichthyosis bullosa of Siemens, ichthyosis exfoliativa, keratosis palmaris et 
plantaris, keratosis pahnoplantaris, pahnoplantar keratoderma, keratosis punctata, Meesmann's 
comeal dystrophy, pachyonychia congenita, white sponge nevus, steatocystoma multiplex, epidermal 

15 nevi/epidermolytic hyperkeratosis type, monilethrix, trichothiodystrophy, chronic 

hepatitis/cryptogenic cirrhosis, and colorectal hyperplasia; a neurological disorder, such as epilepsy, 
ischemic cerebrovascular disease, stroke, cerebral neoplasms, Alzheimer's disease, Pick*s disease, 
Huntington's disease, dementia, Parkinson's disease and other extrapyramidal disorders, amyotrophic 
lateral sclerosis and other motor neuron disorders, progressive neural muscular atrophy, retinitis 

20 pigmentosa, hereditary ataxias, multiple sclerosis and other demyelinatmg diseases, bacterial and 
viral meningitis, brain abscess, subdural empyema, epidural abscess, suppurative intracranial 
thrombophlebitis, myelitis and radiculitis, viral central nervous system disease, prion diseases 
including kuru, Creutzfeldt Jakob disease, and Gerstmann-Straussler-Scheinber syndrome, fatal 
familial insomnia, nutritional and metabolic diseases of the nervous system, neurofibromatosis, 

25 tuberous sclerosis, cerebelloretinal hemangioblastomatosis, encephalotrigeminal syndrome, mental 
retardation and other developmental disorders of the central nervous system including Down 
syndrome, cerebral palsy, neuroskeletal disorders, autonomic nervous system disorders, cranial nerve 
disorders, spinal cord diseases, muscular dystrophy and other neuromuscular disorders, peripheral 
nervous system disorders, dermatomyositis and polymyositis, inherited, metabolic, endocrine, and 

30 toxic myopathies, myasthenia gravis, periodic paralysis, mental disorders including mood, anxiety, 
and schizophrenic disorders, seasonal affective disorder (SAD), akathesia, amnesia, catatonia, 
diabetic neuropathy, tardive dyskinesia, dystonias, paranoid psychoses, postherpetic neuralgia, 
Tourette's disorder, progressive supranuclear palsy, corticobasal degeneration, and familial 
frontotemporal dementia; and a reproductive disorder, such as infertility, including tubal disease, 

35 ovulatory defects, and endometriosis, a disorder of prolactin production, a disruption of the estrous 
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cycle, a disruption of the menstrual cycle, polycystic ovary syndrome, ovarian hyperstimulation 
syndrome, an endometrial or ovarian tumor, a uterine fibroid, autoinmiune disorders, an ectopic 
pregnancy, and teratogenesis; cancer of the breast, fibrocystic breast disease, and galactorrhea; a 
disruption of spermatogenesis, abnormal sperm physiology, cancer of the testis, cancer of the 
prostate, benign prostatic hyperplasia, prostatitis, Peyronie's disease, impotence, carcinoma of the 
male breast, and gynecomastia. 

In another embodiment, a vector capable of expressing PMOD or a fragment or derivative 
thereof may be administered to a subject to treat or prevent a disorder associated with decreased 
expression or activity of PMOD including, but not limited to, those described above. 

Li a further embodiment, a composition comprising a substantially purified PMOD in 
conjunction with a suitable pharmaceutical carrier may be administered to a subject to treat or prevent 
a disorder associated with decreased expression or activity of PMOD including, but not limited to, 
those provided above. 

In still another embodiment, an agonist which modulates the activity of PMOD may be 
administered to a subject to treat or prevent a disorder associated with decreased expression or 
activity of PMOD including, but not limited to, those listed above. 

In a further embodiment, an antagonist of PMOD may be administered to a subject to treat or 
prevent a disorder associated with increased expression or activity of PMOD. Examples of such 
disorders include, but are not limited to, those gastrointestinal, cardiovascular, 
autoimmune/inflammatory, cell proliferative, developmental, epithelial, neurological, and 
reproductive disorders described above. In one aspect, an antibody which specifically binds PMOD 
may be used directly as an antagonist or indirectly as a targeting or delivery mechanism for bringing a 
pharmaceutical agent to cells or tissues which express PMOD. 

In an additional embodiment, a vector expressing the complement of the polynucleotide 
encoding PMOD may be administered to a subject to treat or prevent a disorder associated with 
increased expression or activity of PMOD including, but not limited to, those described above. 

In other embodiments, any protein, agonist, antagonist, antibody, complementary sequence, 
or vector embodiments may be administered in combination with other appropriate therapeutic 
agents. Selection of the appropriate agents for use in combination therapy may be made by one of 
ordinary skill in the art, according to conventional pharmaceutical principles. The combination of 
therapeutic agents may act synergistically to effect the treatment or prevention of the various 
disorders described above. Using this approach, one may be able to achieve therapeutic efficacy with 
lower dosages of each agent, thus reducing the potential for adverse side effects. 

An antagonist of PMOD may be produced using methods which are generally known in the 
art. In particular, purified PMOD may be used to produce antibodies or to screen libraries of 



63 



wo 03/000844 



PCT/US02/19360 



pharmaceutical agents to identify those which specifically bind PMOD, Antibodies to PMOD may 
also be generated using methods that are well known in the art. Such antibodies may include, but are 
not limited to, polyclonal, monoclonal, chimeric, and single chain antibodies. Fab fragments, and 
fragments produced by a Fab expression library. Neutralizing antibodies (i.e., those which inhibit 
5 dimer formation) are generally preferred for therapeutic use. Single chain antibodies (e.g., from 
camels or llamas) may be potent enzyme inhibitors and may have advantages in the design of peptide 
mimetics, and in the development of immuno-adsorbents and biosensors (Muyldermans, S. (2001) J. 
Biotechnol. 74:277-302). 

For the production of antibodies, various hosts including goats, rabbits, rats, mice, camels, 

10 dromedaries, llamas, humans, and others may be inomunized by injection with PMOD or with any 
fragment or oligopeptide thereof which has immunogenic properties. Depending on the host species, 
various adjuvants may be used to increase immunological response. Such adjuvants include, but are 
not limited to, Freund's, ndneral gels such as aluminum hydroxide, and surface active substances such 
as lysolecithin, pluronic polyols, pclyanions, peptides, oil emulsions, KLH, and dinitrophenoL 

15 Among adjuvants used in humans, BCG (bacilli Calmette-Guerin) and Corynebacterium parvwn are 
especially preferable. 

It is preferred that the oligopeptides, peptides, or fragments used to induce antibodies to 
PMOlD have an amino acid sequence consisting of at least about 5 amino acids, and generally will 
consist of at least about 10 amino acids. It is also preferable that these oligopeptides, peptides, or 

20 fragments are identical to a portion of the amino acid sequence of the natural protein. Short stretches 
of PMOD amino acids may be fused with those of another protein, such as KLH, and antibodies to 
the chimeric molecule may be produced. 

Monoclonal antibodies to PMOD may be prepared using any technique which provides for 
the production of antibody molecules by continuous cell lines in culture. These include, but are not 

25 limited to, the hybridoma technique, the hunlan B-cell hybridoma technique, and the EBV-hybridon[ia 
technique. (See, e.g., Kohler, G. et al. (1975) Nature 256:495-497; Kozbor, D. et al. (1985) J. 
Immunol. Methods 81:31-42; Cote, R.J. et al. (1983) Proc. Natl. Acad. Sci. USA 80:2026-2030; and 
Cole, S.P. et al. (1984) Mol. Cell Biol. 62:109-120.) 

In addition, techniques developed for the production of "chimeric antibodies," such as the 

30 splicing of mouse antibody genes to human antibody genes to obtain a molecule with appropriate 
antigen specificity and biological activity, can be used. (See, e.g., Morrison, S.L. et al. (1984) Proc. 
Natl. Acad. Sci. USA 81:6851-6855; Neuberger, M.S. et al. (1984) Nature 312:604-608; and Takeda, 
S. et al. (1985) Nature 314:452-454.) Alternatively, techniques described for the production of single 
chain antibodies may be adapted, using methods known in the art, to produce PMOD-specific single 

35 chain antibodies. Antibodies with related specificity, but of distinct idiotypic composition, may be 

64 



wo 03/000844 



PCT/US02/19360 



generated by chain shuffling from random combinatorial immunoglobulin libraries. (See, e.g.. 
Burton, D.R. (1991) Proc. Natl. Acad. Sci. USA 88:10134-10137.) 

Antibodies may also be produced by inducing in vivo production in the lymphocyte 
population or by screening immunoglobulin Ubraries or panels of highly specific binding reagents as 
5 disclosed in the literature. (See, e.g., Orlandi, R. et al. (1989) Proc. Nad. Acad. Sci. USA 
86:3833-3837; Winter, G. et al. (1991) Nature 349:293-299.) 

Antibody fragments which contain specific binding sites for PMOD may also be generated. 
For example, such fragments include, but are not limited to, F(ab\ fragments produced by pepsin 
digestion of the antibody molecule and Fab fragments generated by reducing the disulfide bridges of 

10 the F(ab')2 fragments. Alternatively, Fab expression libraries may be constructed to allow rapid and 
easy identification of monoclonal Fab fragments with the desired specificity. (See, e.g., Huse, W.D. 
et al. (1989) Science 246: 1275-1281.) 

Various immunoassays may be used for screening to identify antibodies having the desired 
specificity. Numerous protocols for competitive binding or immunoradioihetric assays usmg either 

15 polyclonal or monoclonal antibodies with established specificities are well known in the art. Such 
immunoassays typically involve the measurement of complex formation between PMOD and its 
specific antibody. A two-site, monoclonal-based immunoassay utilizing monoclonal antibodies 
reactive to two non-interfering PMOD epitopes is generally used, but a conq)etitive binding assay 
may also be employed (Pound, supra). 

20 Various noethods such as Scatchard analysis in conjunction with radioimmunoassay 

techniques may be used to assess the affinity of antibodies for PMOD, Affinity is expressed as an 
association constant, K^, which is defined as the molar concentration of PMOD-antibody complex 
divided by the molar concentrations of free antigen and free antibody under equilibrium conditions. 
The Ka determined for a preparation of polyclonal antibodies, which are heterogeneous in their 

25 affinities for multiple PMOD epitopes, represents the average affinity, or avidity, of the antibodies for 
PMOD. The determined for a preparation of monoclonal antibodies, which are monospecific for a 
particular PMOD epitope, represents a true measure of affinity. High-affinity antibody preparations 
with Ka ranging from about 10^ to 10^^ L/mole are preferred for use in immunoassays in which the 
PMOD-antibody complex must withstand rigorous manipulations. Low-affinity antibody 

30 preparations with ranging from about 10^ to 10^ L/mole are preferred for use in 

immunopurification and similar procedures which ultimately require dissociation of PMOD, 
preferably in active form, from the antibody (Catty, D. (1988) Antibodies, Volume I: A Practical 
Approach , IRL Press, Washington DC; Liddell, J.E. and A. Cryer (1991) A Practical Guide to 
Monoclonal Antibodies , John Wiley & Sons, New York NY). 

35 The titer and avidity of polyclonal antibody preparations may be further evaluated to 
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determine the quality and suitability of such preparations for certain downstream applications. For 
example, a polyclonal antibody preparation containing at least 1-2 mg specific antibody/ml, 
preferably 5-10 mg specific antibody/ml, is generally employed in procedures requiring precipitation 
of PMOD-antibody complexes. Procedures for evaluating antibody specificity, titer, and avidity, and 

5 guidelines for antibody quality and usage in various applications, are generally available. (See, e.g., 
Catty, supra, and Coligan et al. supra,) 

' In another embodiment of the invention, polynucleotides encoding PMOD, or any fragment 
or complement thereof, may be used for therapeutic purposes. In one aspect, modifications of gene 
expression can be achieved by designing complementary sequences or antisense molecules (DNA, 

10 RNA, PNA, or modified oligonucleotides) to the coding or regulatory regions of the gene encoding 
PMOD. Such technology is well known in the art, and antisense oligonucleotides or larger fragments 
can be designed from various locations along the coding or control regions of sequences encoding 
PMOD. (See, e.g., Agrawal, S., ed. (1996) Antisense Therapeutics , Humana Press Inc., Totawa NJ.) 
In therapeutic use, any gene delivery system suitable for introduction of the antisense 

15 sequences into appropriate target cells can be used. Antisense sequences can be delivered 

intracellularly in the form of an expression plasnaid which, upon transcription, produces a sequence 
complementary to at least a portion of the cellular sequence encoding the target protein. (See, e.g.. 
Slater, J.E. et al. (1998) J. Allergy Clin. Immunol. 102(3):469-475; and Scanlon, K.J. et al. (1995) 
9(13): 1288-1296.) Antisense sequences can also be introduced intracellularly through the use of viral 

20 vectors, such as retrovirus and adeno-associated virus vectors. (See, e.g.. Miller, A.D. (1990) Blood 
76:271; Ausubel, supra\ Uckert, W. and W. Walther (1994) Pharmacol. Ther. 63(3):323-347.) Other 
gene delivery mechanisms include liposome-derived systems, artificial viral envelopes, and other 
systems known in the art. (See, e.g., Rossi, J.J. (1995) Br. Med. Bull. 51(l):217'-225; Boado, RJ. et 
al. (1998) J. Pharm. Sci. 87(11):1308-1315; and Morris, M.C. et al. (1997) Nucleic Acids Res. 

25 25(14):2730-2736.) 

In another embodiment of the invention, polynucleotides encoding PMOD may be used for 
somatic or germline gene therapy. Gene therapy may be performed to (i) correct a genetic deficiency 
(e.g., in the cases of severe combined immunodeficiency (SCID)-Xl disease characterized by X- 
linked inheritance (Cavazzana-Calvo, M. et al. (2000) Science 288:669-672), severe combined 

30 immunodeficiency syndrome associated with an inherited adenosine deaminase (ADA) deficiency 
CBlaese, R.M. et al. (1995) Science 270:475-480; Bordignon, C. et al. (1995) Science 270:470-475), 
cystic fibrosis (Zabner, J. et al. (1993) Cell 75:207-216; Crystal, R.G. et al. (1995) Hum. Gene 
Therapy 6:643-666; Crystal, R.a et al. (1995) Hum. Gene Therapy 6:667-703), thalassamias, familial 
hypercholesterolemia, and hemophilia resulting from Factor Vin or Factor DC deficiencies (Crystal, 

35 R.G. (1995) Science 270:404-410; Verma, I.M. and N. Somia (1997) Nature 389:239-242)), (ii) 
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express a conditionally lethal gene product (e.g., in the case of cancers which result from unregulated 
cell proliferation), or (iii) express a protein which affords protection against intracellular parasites 
(e.g., against human retroviruses, such as human immunodeficiency virus (HIV) (Baltimore, D. 
(1988) Nature 335:395-396; Poeschla, E. et al (1996) Proc. Natl, Acad. Sci, USA 93:11395-11399), 
5 hepatitis B or C virus (HBV, HCV); fungal parasites, such as Candida albicans and Paracoccidioides 
brasiliensis\ and protozoan parasites such as Plasmodium falciparum and Trypanosoma cnizi)- In the 
case where a genetic deficiency in PMOD expression or regulation causes disease, the expression of 
PMOD from an appropriate population of transduced cells may alleviate the clinical manifestations 
caused by the genetic deficiency. 

10 In a further embodiment of the invention, diseases or disorders caused by deficiencies in 

PMOD are treated by constructing mammalian expression vectors encoding PMOD and introducing 
these vectors by mechanical means into PMOD-deficient cells. Mechanical transfer technologies for 
use with cells in vivo or ex vitro include (i) direct DNA microinjection into individual cells, (ii) 
ballistic gold particle delivery, (iii) liposome-mediated transfection, (iv) receptor-mediated gene 

15 transfer, and (v) the use of DNA transposons (Morgan, R.A. and W.F. Anderson (1993) Annu. Rev. 
Biochem. 62:191-217; Ivies, Z. (1997) Cell 91:501-510; Boulay, J-L. and H. Recipon (1998) Curr. 
0pm. Biotechnol. 9:445-450). 

Expression vectors that may be effective for the expression of PMOD include, but are not 
limited to, the PCDNA 3.1, EPITAG, PRCCMV2, PREP, PVAX, PCR2-T0P0TA vectors 

20 (Invitrogen, Carlsbad CA), PCMV-SCRIPT, PCMV-TAG, PEGSH/PERV (Stratagene, La Jolla CA), 
and PTET-OFF, PTET-ON, PTRE2, PTRE2-LUC, PTK-HYG (Clontech, Palo Alto CA). PMOD . 
may be expressed using (i) a constitutively active promoter, (e.g., from cytomegalovirus (CMV), 
Rous sarcoma vims (RSV), SV40 virus, thymidine kinase (TK), or P-actin genes), (ii) an inducible 
promoter (e.g., the tetracycline-regulated promoter (Gossen, M. and H. Bujard (1992) Proc. Natl. 

25 Acad, Sci. USA 89:5547-5551; Gossen. M. et al. (1995) Science 268:1766-1769; Rossi, F.M.V. and 
H.M. Blau (1998) Curr. Opin. Biotechnol, 9:451-456), commercially available in the T-REX plasmid 
(Invitrogen)); the ecdysone-inducible promoter (available in the plasmids PVGRXR and PIND; 
Invitrogen); the FK506/rapamycin inducible promoter; or the RU486/mifepristone inducible promoter 
(Rossi, F.M.V. and H.M. Blau, supra)), or (iii) a tissue-specific promoter or the native promoter of 

30 the endogenous gene encoding PMOD from a normal individual. 

Commercially available liposome transformation kits (e.g., the PERFECT LIPID 
TRANSFECTION KIT, available from Invitrogen) allow one with ordinary skill in the art to deliver 
polynucleotides to target cells in culture and require minimal effort to optimize experimental 
parameters. In the alternative, transformation is performed using the calcium phosphate method 

35 (Graham, F.L. and A.J. Eb (1973) Virology 52:456-467), or by electroporation (Neumann, E. et al. 
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(1982) EMBO J. 1:841-845). The introduction of DNA to primary cells requires modification of 
these standardized mammalian transfection protocols. 

In another embodiment of the invention, diseases or disorders caused by genetic defects with 
respect to PMOD expression are treated by constructing a retrovirus vector consisting of (i) the 

5 polynucleotide encoding PMOD under the control of an independent promoter or the retrovirus long 
terminal repeat (LTR) promoter, (ii) appropriate RNA packaging signals, and (iii) a Rev-responsive 
element (RKE) along with additional retrovirus cw-acting RNA sequences and coding sequences 
required for efficient vector propagation. Retrovirus vectors (e.g., PFB and PFBNEO) are 
conmiercially available (Stratagene) and are based on published data (Riviere, I. et al. (1995) Proc. 

10 Natl. Acad. Sci. USA 92:6733-6737), incorporated by reference herein. The vector is propagated in 
an appropriate vector producing cell line (VPCL) that expresses an envelope gene with a tropism for 
receptors on the target cells or a promiscuous envelope protein such as VSVg (Armentano, D. et al. 
(1987) J. Virol. 61:1647-1650; Bender, M.A. et al. (1987) J. Virol. 61:1639-1646; Adam, M.A. and 
A.D. MiUer (1988) J. ViroL 62:3802-3806; Dull, T. et al. (1998) J. Virol. 72:8463-8471; Zufferey, R. 

15 et al. (1998) J. ViroL 72:9873-9880). U.S. Patent No. 5,910,434 to Rigg ("Method for obtaming 
retrovirus packaging cell lines producing high transducing efficiency retroviral supernatant") 
discloses a method for obtaining retrovirus packaging cell lines and is hereby incorporated by 
reference. Propagation of retrovirus vectors, transduction of a population of cells (e.g., CD4* T- 
cells), and the return of transduced cells to a patient are procedures well known to persons skilled in 

20 the art of gene therapy and have been well documented (Ranga, U. et al. (1997) J. ViroL 71:7020- 
7029; Bauer, G. et al. (1997) Blood 89:2259-2267; Bonyhadi, M.L. (1997) J. ViroL 71:4707-4716; 
Ranga, U. et al. (1998) Proc. Natl. Acad. Sci. USA 95:1201-1206; Su, L. (1997) Blood 89:2283- 
2290). 

In an embodiment, an adenovirus-based gene therapy delivery system is used to deliver 
25 polynucleotides encoding PMOD to cells which have one or more genetic abnormalities with respect 
to the expression of PMOD. The construction and packaging of adenovirus-based vectors are well 
known to those with ordinary skill in the art. Replication defective adenovirus vectors have proven to 
be versatile for importing genes encoding immunoregulatory proteins into intact islets in the pancreas 
(Csete, M.E. et al. (1995) Transplantation 27:263-268). Potentially useful adenoviral vectors are 
30 described in U.S. Patent No. 5,707,618 to Armentano ("Adenovirus vectors for gene therapy"), 
hereby incorporated by reference. For adenoviral vectors, see also Antinozzi, P.A. et al. (1999) 
Annu. Rev. Nutr. 19:511-544 and Verma, LM. and N. Somia (1997) Nature 18:389:239-242, both 
incorporated by reference herein. 

In another embodiment, a herpes-based, gene therapy delivery system is used to deliver 
35 polynucleotides encoding PMOD to target cells which have one or more genetic abnormalities with 
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respect to the expression of PMOD. The use of herpes simplex virus (HSV)-based vectors may be 
especially valuable for introducing PMOD to cells of the central nervous system, for which HSV has 
a tropism. The construction and packaging of herpes-based vectors are well known to those with 
ordinary skill in the art. A replication-competent herpes simplex virus (HSV) type 1 -based vector has 
5 been used to deliver a reporter gene to the eyes of primates (Liu, X. et al. (1999) Exp. Eye Res. 
169:385-395). The construction of a HSV-1 virus vector has also been disclosed in detail in U.S. 
Patent No. 5,804,413 to DeLuca ("Herpes simplex virus strains for gene transfer"), which is hereby 
incorporated by reference. U.S. Patent No. 5,804,413 teaches the use of recombinant HSV d92 which 
consists of a genome containing at least one exogenous gene to be transferred to a cell under the 

10 control of the appropriate promoter for purposes including human gene therapy. Also taught by this 
patent are the construction and use of recombinant HSV strains deleted for ICP4, ICP27 and ICP22. 
For HSV vectors, see also Goins, W.R et al. (1999) J. Virol. 73:519-532 and Xu, H. et al. (1994) 
Dev. Biol. 163: 152-161, hereby incorporated by reference. The manipulation of cloned herpesvirus 
sequences, the generation of recombinant virus foUowinjg the transfection of multiple plasmids 

15 containing different segments of the large herpesvirus genomes, the growth and propagation of 
herpesvirus, and the infection of cells with herpesvirus are techniques well known to those of 
ordmary skill in the art. 

In another embodiment, an alphavkus (positive, single-stranded RNA virus) vector is used to 
deliver polynucleotides encoding PMOD to target cells. The biology of the prototypic alphavirus, 

20 Semliki Forest Virus (SFV), has been studied extensively and gene transfer vectors have been based 
on the SFV genome (Garoff, H. and K.-J. Li (1998) Curr. 0pm. Biotechnol. 9:464-469). During 
alphavirus RNA replication, a subgenomic RNA is generated that normally encodes the viral capsid 
proteins. This subgenomic RNA replicates to higher levels than the full length genomic RNA, 
resulting in the oveiproduction of capsid proteins relative to the viral proteins with enzymatic activity 

25 (e.g., protease and polymerase). Sinndlarly, inserting the coding sequence for PMOD into the 

alphavirus genome in place of the capsid-coding region results in the production of a large number of 
PMOD-coding RNAs and the synthesis of high levels of PMOD in vector transduced cells. While 
alphavirus infection is typically associated with cell lysis within a few days, the ability to establish a 
persistent infection in hamster normal kidney cells (BHK-21) with a variant of Sindbis virus (SIN) 

30 indicates that the lytic replication of alphaviruses can be altered to suit the needs of the gene therapy 
application (Dryga, S.A. et al. (1997) Virology 228:74-83). The wide host range of alphaviruses will 
allow the introduction of PMOD into a variety of cell types. The specific transduction of a subset of 
cells in a population may require the sorting of cells prior to transduction. The methods of 
manipulating infectious cDNA clones of alphaviruses, performing alphavirus cDNA and RNA 

35 transfections, and performing alphavirus infections, are well known to those with ordinary skill in the 
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art. 

Oligonucleotides derived from the transcription initiation site, e.g., between about positions 
-10 and +10 from the start site, may also be employed to inhibit gene expression. Similarly, 
inhibition can be achieved using triple helix base-pairing methodology. Triple helix pairing is useful 
because it causes inhibition of the ability of the double hehx to open sufficiently for the binding of 
polymerases, transcription factors, or regulatory molecules. Recent therapeutic advances using 
triplex DNA have been described in the literature. (See, e.g., Gee, J.E. et al. (1994) in Huber, B.E. 
and B J. Carr, Molecular a nd Tmmnn ologic Approaches . Futura Publishing, Mt. Kisco NY, pp. 163- 
177.) A complementary sequence or antisense molecule may also be designed to block translation of 
mRNA by preventing the transcript from binding to ribosoxnes. 

Ribozymes, enzymatic RNA molecules, may also be used to catalyze the specific cleavage of 
RNA. The mechanism of ribozyme action involves sequence-specific hybridization of the ribozyme 
molecule to complementary target RNA, followed by endonucleolytic cleavage. For example, 
engineered hammerhead motif ribozyme molecules may specifically and efficiently catalyze 
endonucleolytic cleavage of RNA molecules encoding PMOD. 

Specific ribozyme cleavage sites within any potential RNA target are initially identified by 
scanning the target molecule for ribozyme cleavage sites, including the following sequences: GUA, 
GUU, and GUC. Once identified, short RNA sequences of between 15 and 20 ribonucleotides, 
corresponding to the region of the target gene containing the cleavage site, may be evaluated for 
secondary structural features which may render the oligonucleotide inoperable. The suitability of 
candidate targets may also be evaluated by testing accessibility to hybridization with complementary 
oligonucleotides using ribonuclease protection assays. 

Complementary ribonucleic acid molecules and ribozymes may be prepared by any method 
known in the art for the synthesis of nucleic acid molecules. These include techniques for chemically 
synthesizing oligonucleotides such as solid phase phosphoramidite chemical synthesis. Alternatively, 
RNA molecules may be generated by in vitro and in vivo transcription of DNA molecules encoding 
PMOD, Such DNA sequences may be incorporated into a wide variety of vectors with suitable RNA 
polymerase promoters such as T7 or SP6. Alternatively, these cDNA constructs that synthesize 
complementary RNA, constitutively or inducibly, can be introduced into cell lines, cells, or tissues. 

RNA molecules may be modified to increase intracellular stability and half-life. Possible 
modifications include, but are not limited to, the addition of flanking sequences at the 5' and/or 3' 
ends of the molecule, or the use of phosphorothioate or 2' 0-methyl rather than phosphodiesterase 
linkages within the backbone of the molecule. This concept is inherent in the production of PNAs 
and can be extended in all of these molecules by the inclusion of nontraditional bases such as inosine, 
queosine, and wybutosine, as well as acetyl-, methyl-, thio-, and similarly modified forms of adenine. 
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cytidine, guanine, thymine, and uridine which are not as easily recognized by endogenous 
endonucleases. 

An additional embodiment of the invention encompasses a method for screening for a 
compound which is effective in altering expression of a polynucleotide encoding PMOD. 
5 Compounds which may be effective in altering expression of a specific polynucleotide may include, 
but are not limited to, oligonucleotides, antisense oligonucleotides, triple helix-forming 
oligonucleotides, transcription factors and other polypeptide transcriptional regulators, and non- 
macromolecular chemical entities which are capable of interacting with specific polynucleotide 
sequences. Effective compounds may alter polynucleotide expression by acting as either inhibitors or 

10 promoters of polynucleotide expression. Thus, in the treatment of disorders associated with increased 
PMOD expression or activity, a compound which specifically inhibits expression of the 
polynucleotide encoding PMOD nnay be therapeutically useful, and in the treatment of disorders 
associated with decreased PMOD expression or activity, a compound which specifically promotes 
expression of the polynucleotide encoding PMOD may be therapeutically useful. 

15 At least one, and up to a plurality, of test compounds may be screened for effectiveness in 

altering expression of a specific polynucleotide. A test compound may be obtained by any method 
commonly known m the art, including chemical modification of a compound known to be effective in 
altering polynucleotide expression; selection from an existing, commercially-available or proprietary 
library of naturallyK)ccurring or non-natural chemical compounds; rational design of a compound 

20 based on chemical and/or structural properties of the target polynucleotide; and selection firom a 
library of chemical compounds created combinatorially or randomly. A sample comprising a 
polynucleotide encoding PMOD is exposed to at least one test compound thus obtained. The sample 
may comprise, for example, an intact or permeabilized cell, or an in vitro cell-free or reconstituted 
biochemical system. Alterations in the expression of a polynucleotide encoding PMOD are assayed 

25 by any method conmionly known in the art. Typically, the expression of a specific nucleotide is 

detected by hybridization with a probe having a nucleotide sequence con^iplementary to the sequence 
of the polynucleotide encoding PMOD. The amount of hybridization may be quantified, thus 
forming the basis for a comparison of the expression of the polynucleotide both with and without 
exposure to one or more test compounds. Detection of a change in the expression of a polynucleotide 

30 exposed to a test compound indicates that the test compound is effective in altering the expression of 
the polynucleotide. A screen for a compound effective in altering expression of a specific 
polynucleotide can be carried out, for example, usmg a Schizosaccharomyces pombe gene expression 
system (Atkins, D. et al. (1999) U.S. Patent No. 5,932,435; Amdt, G.M. et al. (2000) Nucleic Acids 
Res. 28:E15) or a human cell line such as HeLa cell (Clarke, M.L. et al. (2000) Biochem. Biophys. 

35 Res. Commun. 268:8-13). A particular embodiment of the present invention involves screening a 
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combinatorial library of oligonucleotides (such as deoxy ribonucleotides, ribonucleotides, peptide 
nucleic acids, and modified oligonucleotides) for antisense activity against a specific polynucleotide 
sequence (Bruice, T.W. et al. (1997) U.S. Patent No. 5,686,242; Bruice, T.W. et al. (2000) U.S. 
Patent No. 6,022,691). 

5 Many methods for introducing vectors into cells or tissues are available and equally suitable 

for use in vivo, in vitro, and ex vivo. For ex vivo therapy, vectors may be introduced into stem cells 
taken from the patient and clonally propagated for autologous transplant back into that same patient 
Delivery by transfection, by liposome injections, or by polycationic amino polymers may be achieved 
using methods which are.well known in the art. (See, e.g., Goldman, C.K. et al. (1997) Nat. 
10 Biotechnol. 15:462-466.) 

Any of the therapeutic methods described above may be applied to any subject in need of 
such therapy, including, for example, mammals such as humans, dogs, cats, cows, horses, rabbits, and 
monkeys. 

An additional embodiment of the invention relates to the administration of a composition 

15 which generally comprises an active ingredient formulated with a pharmaceutically acceptable 
excipient. Excipients may include, for example, sugars, starches, celluloses, gums, and proteins. 
Various formulations are commonly known and are thoroughly discussed in the latest edition of 
Remingto n's Phannaceutical Sciences (Maack Publishing, Easton PA). Such compositions may 
consist of PMOD, antibodies to PMOD, and mimetics, agonists, antagonists, or inhibitors of PMOD. 

20 The compositions utilized in this invention may be administered by any number of routes 

including, but not limited to, oral, intravenous, intramuscular, intra-arterial, intramedullary, 
intrathecal, intraventricular, pulmonary, transdermal, subcutaneous, intraperitoneal, intranasal, 
enteral, topical, sublingual, or rectal means. 

Compositions for pulmonary administration may be prepared in liquid or dry powder form. 

25 These compositions are generally aerosolized immediately prior to inhalation by the patient. In the 
case of small molecules (e.g. traditional low molecular weight organic drugs), aerosol delivery of 
fast-acting formulations is well-known in the art. In the case of macromolecules (e.g. larger peptides 
and proteins), recent developments in the field of pulmonary delivery via the alveolar region of the 
lung have enabled the practical delivery of drugs such as insulin to blood ch-culation (see, e.g., Patton, 

30 J.S. et al., U.S. Patent No. 5,997,848). Pulmonary delivery has the advantage of administration 
without needle injection, and obviates the need for potentially toxic penetration enhancers. 

Compositions suitable for use m the invention include compositions wherein the active 
ingredients are contained in an effective amount to achieve the intended purpose. The determination 
of an effective dose is well within the capability of those skilled in the art. 

35 Specialized forms of compositions may be prepared for direct intracellular delivery of 
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macromolecules comprising PMOD or fragments thereof. For example, liposome preparations 
containing a cell-impermeable macromolecule may promote cell fusion and intracellular delivery of 
the macromolecule. Alternatively, PMOD or a fragment thereof may be joined to a short cationic N- 
terminal portion from the HIV Tat-1 protein. Fusion proteins thus generated have been found to 
5 transduce into the cells of all tissues, including the brain, in a mouse model system (Schwarze, S.R. et 
al. (1999) Science 285:1569-1572). 

For any compound, the therapeutically effective dose can be estimated initially either in cell 
culture assays, e.g., of neoplastic cells, or in animal models such as mice, rats, rabbits, dogs, 
monkeys, or pigs. An animal model may also be used to determine the appropriate concentration 

10 range and route of administration. Such information can then be used to determine useful doses and 
routes for administration in humans. 

A therapeutically effective dose refers to that amount of active ingredient, for example 
PMOD or fragments thereof, antibodies of PMOD, and agonists, antagonists or inhibitors of PMOD, 
which ameliorates the symptoms or condition. Therapeutic efficacy and toxicity may be determined 

15 by standard pharmaceutical procedures in cell cultures or with experimental animals, such as by 
calculating the ED50 (the dose therapeutically effective in 50% of the population) or LD50 (the dose 
lethal to 50% of the population) statistics. The dose ratio of toxic to therapeutic effects is the 
therapeutic index, which can be expressed as the LDgt/EDso ratio. Compositions which exhibit large 
therapeutic indices are preferred. The data obtained from cell culture assays and animal studies are 

20 used to formulate a range of dosage for human use. The dosage contained in such compositions is 
preferably within a range of circulating concentrations that includes the ED50 with little or no toxicity. 
The dosage varies within this range depending upon the dosage form employed, the sensitivity of the 
patient, and the route of admmistration. 

The exact dosage will be determined by the practitioner, in light of factors related to the 

25 subject requiring treatment. Dosage and administration are adjusted to provide sufficient levels of the 
active moiety or to maintain the desired effect. Factors which may be taken into account include the 
severity of the disease state, the general health of the subject, the age, weight, and gender of the 
subject, time and frequency of administration, drug combination(s), reaction sensitivities, and 
response to therapy. Long-acting compositions may be administered every 3 to 4 days, every week, 

30 or biweekly depending on the half-life and clearance rate of the particular formulation. 

Normal dosage amounts may vary from about 0.1 ^tg to 100,000 /.ig, up to a total dose of 
* about 1 gram, depending upon the route of administration. Guidance as to particular dosages and 
methods of delivery is provided in the literature and generally available to practitioners in the art. 
Those skilled in the art will employ different formulations for nucleotides than for proteins or their 

35 inhibitors. Sinailarly, delivery of polynucleotides or polypeptides will be specific to particular cells, 
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conditions, locations, etc. 
DIAGNOSTICS 

In another embodiment, antibodies which specifically bind PMOD nnay be used for the 
diagnosis of disorders characterized by expression of PMOD, or in assays to monitor patients being 

5 treated with PMOD or agonists, antagonists, or inhibitors of PMOD. Antibodies useful for diagnostic 
purposes may be prepared in the same manner as described above for therapeutics. Diagnostic assays 
for PMOD include methods which utilize the antibody and a label to detect PMOD in human body 
fluids or in extracts of cells or tissues. The antibodies may be used with or without modification, and 
may be labeled by covalent or non-covalent attachment of a reporter molecule. A wide variety of 

10 reporter molecules, several of which are described above, are known in the art and may be used. 

A variety of protocols for measuring PMOD, including ELISAs, RIAs, and FACS, are known 
in the art and provide a basis for diagnosing altered or abnormal levels of PMOD expression. Normal 
or standard values for PMOD expression are established by combining body fluids or cell extracts 
taken from normal mammalian subjects, for example, human subjects, with antibodies to PMOD 

15 under conditions suitable for complex formation. The amount of standard complex formation may be 
quantitated by various methods, such as photometric means. Quantities of PMOD expressed in 
subject, control, and disease samples from biopsied tissues are compared with the standard values. 
Deviation between standard and subject values establishes the parameters for diagnosing disease. 

In another embodiment of the invention, polynucleotides encoding PMOD may be used for 

20 diagnostic purposes. The polynucleotides which may be used include oligonucleotides, 

complementary RNA and DNA molecules, and PNAs. The polynucleotides may be used to detect 
and quantify gene expression in biopsied tissues in which expression of PMOD may be correlated 
with disease. The diagnostic assay may be used to determine absence, presence, and excess 
expression of PMOD, and to monitor regulation of PMOD levels during therapeutic intervention. 

25 In one aspect, hybridization with PGR probes which are capable of detecting polynucleotides, 

including genomic sequences, encoding PMOD or closely related molecules may be used to identify 
nucleic acid sequences which encode PMOD. The specificity of the probe, whether it is made from a 
highly specific region, e.g., the 5'regulatory region, or from a less specific region, e.g., a conserved 
motif, and the stringency of the hybridization or amplification will determine whether the probe 

30 identifies only naturally occurring sequences encoding PMOD, allelic variants, or related sequences. 

Probes may also be used for the detection of related sequences, and may have at least 50% 
sequence identity to any of the PMOD encoding sequences. The hybridization probes of the subject 
invention may be DNA or RNA and may be derived from the sequence of SEQ ID NO:29-56 or from 
genomic sequences including promoters, enhancers, and introns of the PMOD gene. 

35 Means for producing specific hybridization probes for polynucleotides encoding PMOD 
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include the cloning of polynucleotides encoding PMOD or PMOD derivatives into vectors for the 
production of niElNA probes. Such vectors are known in the art, are commercially available, and may 
be used to synthesize RNA probes m vitro by means of the addition of the appropriate RNA 
polymerases and the appropriate labeled nucleotides. Hybridization probes may be labeled by a 
5 variety of reporter groups, for example, by radionuclides such as ^^P or ^^S, or by enzymatic labels, 
such as alkaline phosphatase coupled to the probe via avidin/biotin coupling systems, and the like. 

Polynucleotides encoding PMOD may be used for the diagnosis of disorders associated with 
expression of PMOD. Examples of such disorders include, but are not limited to, a gastrointestinal 
disorder, such as dysphagia, peptic esophagitis, esophageal spasnoi, esophageal stricture, esophageal 

10 carcinontia, dyspepsia, indigestion, gastritis, gastric carcinoma, anorexia, nausea, emesis, 
gastroparesis, antral or pyloric edema, abdominal angina, pyrosis, gastroenteritis, intestinal 
obstruction, infections of the intestinal tract, peptic ulcer, cholelithiasis, cholecystitis, cholestasis, 
pancreatitis, pancreatic carcinoufia, biliary tract disease, hepatitis, hyperbilirubinemia, cirrhosis, 
passive congestion of the liver, hepatoma, infectious colitis, ulcerative colitis, ulcerative proctitis, 

15 Crohn's disease, Whipple's disease, Mallory-Weiss syndrome, colonic carcinoma, colonic 

obstruction, irritable bowel syndrome, short bowel syndrome, diarrhea, constipation, gastrointestinal 
hemorrhage, acquired immunodeficiency syndrome (AIDS) enteropathy, jaundice, hepatic 
encephalopathy, hepatorenal syndrome, hepatic steatosis, hemochromatosis, Wilson's disease, alphas 
antitrypsin deficiency, Reye's syndrome, primary sclerosing cholangitis, liver mfarction, portal vein 

20 obstruction and thrombosis, centrilobular necrosis, peliosis hepatis, hepatic vein thrombosis, veno- 
occlusive disease, preeclampsia, eclampsia, acute fatty liver of pregnancy, intrahepatic cholestasis of ^ 
pregnancy, and hepatic tumors including nodular hyperplasias, adenomas, and carcinomas; a 
cardiovascular disorder, such as arteriovenous fistula, atherosclerosis, hypertension, vasculitis, 
Raynaud's disease, aneurysms, arterial dissections, varicose veins, thrombophlebitis and 

25 pHlebothrombosis, vascular tumors, and complications of thrombolysis, balloon angioplasty, vascular 
replacement, and coronary artery bypass graft surgery, congestive heart failure, ischemic heart 
disease, angina pectoris, myocardial infarction, hypertensive heart disease, degenerative valvular 
heart disease, calcific aortic valve stenosis, congenitally bicuspid aortic valve, mitral annular 
calcification, mitral valve prolapse, rheumatic fever and rheumatic heart disease, infective 

30 endocarditis, nonbacterial thrombotic endocarditis, endocarditis of systemic lupus erythematosus, 
carcinoid heart disease, cardiomyopathy, myocarditis, pericarditis, neoplastic heart disease, 
congenital heart disease, and complications of cardiac transplantation; an autoimmune/inflammatory 
disease, such as acquired immunodeficiency syndrome (AIDS), Addison's disease, adult respiratory 
distress syndrome, allergies, ankylosing spondylitis, amyloidosis, anemia, asthma, atherosclerosis, 

35 atherosclerotic plaque rupture, autoimmune hemolytic anemia, autoimmune thyroiditis, autoimmune 
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polyendocrinopathy-candidiasis-ectodermal dystrophy (APECED), bronchitis, cholecystitis, contact 
dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis, diabetes mellitus, emphysema, 
episodic lymphopenia with lymphocytotoxins, erythroblastosis fetalis, erythema nodosum, atrophic 
gastritis, glomerulonephritis, Goodpasture's syndrome, gout, Graves' disease, Hashimoto's 

5 thyroiditis, hypereosinophilia, irritable bowel syndrome, multiple sclerosis, myasthenia gravis, 

myocardial or pericardial mflanamation, osteoarthritis, degradation of articular cartilage, osteoporosis, 
pancreatitis, polymyositis, psoriasis, Reiter's syndrome, rheumatoid arthritis, scleroderma, Sj5gren's 
syndrome, systemic anaphylaxis, systemic lupus erythematosus, systemic sclerosis, thrombocytopenic 
purpura, ulcerative colitis, uveitis, Werner syndrome, complications of cancer, hemodialysis, and 

10 extracorporeal circulation, viral, bacterial, fungal, parasitic, protozoal, and helminthic infections, and 
traimia; a cell proliferative diisorder such as actinic keratosis, arteriosclerosis, atherosclerosis, bursitis, 
cirrhosis, hepatitis, mixed connective tissue disease (MCTD), myelofibrosis, paroxysmal nocturnal 
hemoglobinuria, polycythemia vera, psoriasis, primary thrombocythemia, and cancers including 
adenocarcinoma, leukemia, lymphoma, melanoma, myeloma, sarcoma, teratocarcinoma, and, in 

15 particular, cancers of the adrenal gland, bladder, bone, bone marrow, brain, breast, cervix, gall 

bladder, ganglia, gastrointestinal tract, heart, kidney, liver, lung, muscle, ovary, pancreas, parathyroid, 
penis,.prostate, salivary glands, skin, spleen, testis, thymus, thyroid, and uterus; a developmental 
disorder, such as renal tubular acidosis, anemia, Gushing' s syndrome, achondroplastic dwarfism, 
Duchenne and Becker muscular dystrophy, bone resorption, epilepsy, gonadal dysgenesis, WAGR 

20 syndrome (Wilms* tumor, aniridia, genitourinary abnormalities, and mental retardation), Smith- 
Magenis syndrome, myelodysplastic syndrome, hereditary mucoepithelial dysplasia, hereditary 
keratodermas, hereditary neuropathies such as Charcot-Marie-Tooth disease and neurofibromatosis, 
hypothyroidism, hydrocephalus, seizure disorders such as Syndenham's chorea and cerebral palsy, 
spina bifida, anencephaly, craniorachischisis, congenital glaucoma, cataract, age-related macular 

25 degeneration, and sensorineural hearing loss; an epithelial disorder, such as dyshidrotic eczema, 

allergic contact dermatitis, keratosis pilaris, melasma, vitiligo, actinic keratosis, basal cell carcinoma, 
squamous cell carcinoma, seborrheic keratosis, folliculitis, herpes simplex, herpes zoster, varicella, 
candidiasis, dermatophytosis, scabies, insect bites, cherry angioma, keloid, dermatofibroma, 
acrochordons, urticaria, transient acantholytic dermatosis, xerosis, eczema, atopic dermatitis, contact 

30 dermatitis, hand eczema, nummular eczema, lichen simplex chronicus, asteatotic eczema, stasis 
dermatitis and stasis ulceration, seborrheic dermatitis, psoriasis, lichen planus, pityriasis rosea, 
impetigo, ecthynna, dermatophytosis, tinea versicolor, warts, acne vulgaris, acne rosacea, pemphigus 
vulgaris, pemphigus foliaceus, paraneoplastic pemphigus, bullous pemphigoid, herpes gestationis, 
dermatitis herpetiformis, linear IgA disease, epidermolysis bullosa acquisita, dermatomyositis, lupus 

35 erythemiatosus, scleroderma and morphea, erythroderma, alopecia, figurate skin lesions, 
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telangiectasias, hypopigmentation, hyperpigmentation, vesicles/buUae, exanthems, cutaneous drug 
reactions, papulonodular skin lesions, chronic non-healing wounds, photosensitivity diseases, 
epidermolysis bullosa simplex, epidermolytic hyperkeratosis, epidermolytic and nonepidermolytic 
palmoplantar keratoderma, ichthyosis bullosa of Siemens, ichthyosis exfoliativa, keratosis palmaris et 
5 plantaris, keratosis palmoplantaris, palmoplantar keratoderma, keratosis punctata, Meesmann' s 

comeal dystrophy, pachyonychia congenita, white sponge nevus, steatocystoma multiplex, epidermal 
nevi/epidermolytic hyperkeratosis type, monilethrix, trichothiodystrophy, chronic 
hepatitis/cryptogenic cirrhosis, and colorectal hyperplasia; a neurological disorder, such as epilepsy, 
ischemic cerebrovascular disease, stroke, cerebral neoplasms, Alzheimer's disease. Pick's disease, 

10 Huntington's disease, dementia, Parkinson's disease and other extrapyramidal disorders, amyotrophic 
lateral sclerosis and other motor neuron disorders, progressive neural muscular atrophy, retinitis 
pigmentosa, hereditary ataxias, multiple sclerosis and other demyelinating diseases, bacterial and 
viral meningitis, brain abscess, subdural empyema, epidural abscess, suppurative intracranial 
thrombophlebitis, myelitis and radiculitis, viral central nervous system disease, prion diseases 

15 including kuni, Creutzfeldt-Jakob disease, and Gerstmann-Straussler-Scheinker syndrome, fatal 
familial insomnia, nutritional and metabolic diseases of the nervous system, neurofibromatosis, 
tuberous sclerosis, cerebelloretinal hemangioblastomatosis, encephalotrigeminal syndrome, mental 
retardation and other developmental disorders of the central nervous system including Down 
syndrome, cerebral palsy, neuroskeletal disorders, autonomic nervous system disorders, cranial nerve 

20 disorders, spinal cord diseases, muscular dystrophy and other neuromuscular disorders, peripheral 
nervous system disorders, dermatomyositis and polymyositis, inherited, metabolic, endocrine, and 
toxic myopathies, myasthenia gravis, periodic paralysis, mental disorders including mood, anxiety, 
and schizophrenic disorders, seasonal affective disorder (SAD), akathesia, amnesia, catatonia, 
diabetic neuropathy, tardive dyskinesia, dystonias, paranoid psychoses, postherpetic neuralgia, 

25 Tourette's disorder, progressive supranuclear palsy, corticobasal degeneration, and familial 

frontotemporal dementia; and a reproductive disorder, such as infertility, including tubal disease, 
ovulatory defects, and endometriosis, a disorder of prolactin production, a disruption of the estrous 
cycle, a disruption of the menstrual cycle, polycystic ovary syndrome, ovarian hyperstimulation 
syndrome, an endometrial or ovarian tumor, a uterine fibroid, autoimmune disorders, an ectopic 

30 pregnancy, and teratogenesis; cancer of the breast, fibrocystic breast disease, and galactorrhea; a 
disruption of spermatogenesis, abnormal sperm physiology, cancer of the testis, cancer of the 
prostate, benign prostatic hyperplasia, prostatitis, Peyronie's disease, impotence, carcinoma of the 
male breast, and gynecomastia. Polynucleotides encoding PMOD may be used in Southern or 
northern analysis, dot blot, or other membrane-based technologies; in PGR technologies; in dipstick, 

35 pin, and multifonnat ELISA-like assays; and in microarrays utilizing fluids or tissues from patients to 
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detect altered PMOD expression. Such qualitative or quantitative methods are well known in the art. 

In a particular aspect, polynucleotides encoding PMOD may be used in assays that detect the 
presence of associated disorders, particularly those mentioned above. Polynucleotides 
complementary to sequences encoding PMOD may be labeled by standard methods and added to a 
5 fluid or tissue sample from a patient under conditions suitable for the formation of hybridization 
complexes. After a suitable incubation period, the sample is washed and the signal is quantified and 
compared with a standard value. If the amount of signal in the patient sample is significantly altered 
in comparison to a control sample then the presence of altered levels of polynucleotides encoding 
PMOD in the sample indicates the presence of the associated disorder. Such assays may also be used 

10 to evaluate the efficacy of a particular therapeutic treatment regimen in animal studies, in clinical 
trials, or to monitor the treatment of an individual patient. 

In order to provide a basis for the diagnosis of a disorder associated with expression of 
PMOD, a normal or standard profile for expression is established. This may be accomplished by 
combining body fluids or cell extracts taken from normal subjects, either animal or human, with a 

15 sequence, or a fragment thereof, encoding PMOD, under conditions suitable for hybridization or 
amplification. Standard hybridization may be quantified by comparing the values obtained from 
normal subjects with values from an experiment in which a known amount of a substantially purified 
polynucleotide is used. Standard values obtained in this manner may be compared with values 
obtained from samples from patients who are symptomatic for a disorder. Deviation from standard 

20 values is used to establish the presence of a disorder. 

Once the presence of a disorder is established and a treatment protocol is initiated, 
hybridization assays may be repeated on a regular basis to determine if the level of expression in the 
patient begins to approximate that which is observed in the normal subject. The results obtained from 
successive assays may be used to show the efficacy of treatment over a period ranging from several 

25 days to months. 

With respect to cancer, the presence of an abnormal amount of transcript (either under- or 
overexpressed) in biopsied tissue from an individual may indicate a predisposition for the 
development of the disease, or may provide a means for detecting the disease prior to the appearance 
of actual clinical symptoms. A more definitive diagnosis of this type may allow health professionals 

30 to employ preventative measures or aggressive treatment earlier, thereby preventing die development 
or further progression of the cancer. 

Additional diagnostic uses for oligonucleotides designed from the sequences encoding 
PMOD may involve the use of PGR. These oligomers may be chemically synthesized, generated 
enzymatically, or produced in vitro. Oligomers will preferably contain a fragment of a polynucleotide 

35 encoding PMOD, or a fragment of a polynucleotide complementary to the polynucleotide encoding 
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PMOD, and will be employed under optimized conditions for identification of a specific gene or 
condition. Oligomers may also be employed under less stringent conditions for detection or 
quantification of closely related DNA or RNA sequences. 

In a particular aspect, oligonucleotide primers derived from polynucleotides encoding PMOD 
5 may be used to detect single nucleotide polymorphisms (SNPs). SNPs are substitutions, insertions 
and deletions that are a frequent cause of inherited or acquired genetic disease in humans. Methods 
of SNP detection include, but are not limited to, single-stranded conformation polymorphism (SSCP) 
and fluorescent SSCP (fSSCP) methods. In SSCP, oligonucleotide primers derived from 
polynucleotides encoding PMOD are used to amplify DNA using the polymerase chain reaction 

10 (PGR). The DNA may be derived, for example, from diseased or normal tissue, biopsy samples, 
bodily fluids, and the like. SNPs in the DNA cause differences in the secondary and tertiary. 
- structures of PGR products in single-stranded form, and these differences are detectable using gel 
electrophoresis in non-denaturing gels. In fSCCP, the oligonucleotide primers are fluorescently 
labeled, which allows detection of the amplimers in high-throughput equipment such as DNA 

15 sequencing machines. Additionally, sequence database analysis methods, termed in silico SNP 
(isSNP), are capable of identifying polymorphisms by comparing the sequence of individual 
overlapping DNA fragments which assemble into a common consensus sequence. These computer- 
based methods filter out sequence variations due to laboratory preparation of DNA and sequencing 
errors using statistical models and automated analyses of DNA sequence chromatogranis. In the 

20 alternative, SNPs may be detected and characterized by mass spectrometry using, for example, the 
high throughput MASSARRAY system (Sequenom, Inc., San Diego CA). 

SNPs may be used to study the genetic basis of human disease. For example, at least 16 
common SNPs have been associated with non-insulin-dependent diabetes mellitus. SNPs are also 
useful for examining differences in disease outcomes in monogenic disorders, such as cystic fibrosis, 

25 sickle cell anemia, or chronic granulomatous disease. For example, variants in the mannose-binding 
lectin, MBL2, have been shown to be correlated with deleterious pulmonary outcomes in cystic 
fibrosis. SNPs also have utihty in pharmacogenomics, the identification of genetic variants that 
influence a patient's response to a drug, such as life-threatening toxicity. For example, a variation in 
N-acetyl transferase is associated with a high incidence of peripheral neuropathy in response to the 

30 anti-tuberculosis drug isoniazid, while a variation in the core promoter of the ALOX5 gene results in 
diminished clinical response to treatment with an anti-asthma drug that targets the 5-lipoxygenase 
pathway. Analysis of the distribution of SNPs in different populations is useful for investigating 
genetic drift, mutation, recombination, and selection, as well as for tracing the origins of populations 
and their migrations. (Taylor, J.G. et al. (2001) Trends Mol. Med. 7:507-512; Kwok, P.-Y. and Z. Gu 

35 (1999) Mol. Med. Today 5:538-543; Nowotny, P. et al. (2001) Curr. Opin. Neurobiol. 1 1:637-641.) 
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Methods which may also be used to quantify the expression of PMOD include radiolabeling 
or biotinylating nucleotides, coamplification of a control nucleic acid, and interpolating results from 
standard curves. (See, e.g., Melby, P.C. et al. (1993) J. hnmunol. Methods 159:235-244; Duplaa, C. 
et al. (1993) Anal. Biochem. 212:229-236.) The speed of quantitation of multiple samples may be 
5 accelerated by running the assay in a high-throughput format where the oligomer or polynucleotide of 
interest is presented in various dilutions and a spectrophotometric or colorimetric response gives 
rapid quantitation. 

Lfi further embodiments, oligonucleotides or longer fragments derived from any of the 
polynucleotides described herein may be used as elements on a microarray. The microarray can be 

10 used in transcript imaging techniques which monitor the relative expression levels of large numbers 
of genes simultaneously as described below. The microarray may also be used to identify genetic 
variants, mutations, and polymorphisms. This information may be used to determine gene function, 
to understand the genetic basis of a disorder, to diagnose a disorder, to monitor 
progression/regression of disease as a function of gene expression, and to develop and monitor the 

15 activities of therapeutic agents in the treatment of disease. In particular, this information may be used 
to develop a pharmacogenomic profile of a patient in order to select the most appropriate and 
effective treatment regimen for that patient. For example, therapeutic agents which are highly 
effective and display the fewest side effects may be selected for a patient based on his/her 
pharmacogenomic profile. 

20 In another embodunent, PMOD, fragments of PMOD, or antibodies specific for PMOD may 

be used as elements on a microarray. The microarray may be used to monitor or measure protein- 
protein interactions, drug-target interactions, and gene expression profiles, as described above. 

A particular embodiment relates to the use of the polynucleotides of the present invention to 
generate a transcript image of a tissue or cell type. A transcript image represents the global pattern of 

25 gene expression by a particular tissue or cell type. Global gene expression patterns are analyzed by 
quantifying the number of expressed genes and their relative abundance imder given conditions and at 
a given time. (See Seilhamer et al., "Comparative Gene Transcript Analysis," U.S. Patent No. 
5,840,484, expressly incorporated by reference herein.) Thus a transcript image may be generated by 
hybridizing the polynucleotides of the present invention or their complements to the totality of 

30 transcripts or reverse transcripts of a particular tissue or cell type. In one embodiment, the 

hybridization takes place in high-throughput format, wherein the polynucleotides of the present 
invention or their complements comprise a subset of a plurality of elements on a microarray. The 
resultant transcript image would provide a profile of gene activity. 

Transcript images may be generated using transcripts isolated from tissues, cell lines, 

35 biopsies, or other biological samples. The transcript image may thus reflect gene expression in vivo, • 
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as in the case of a tissue or biopsy sample, or in vitro, as in the case of a cell line. 

Transcript images which profile the expression of the polynucleotides of the present 
invention may also be used in conjunction with in vitro model systems and preclinical evaluation of 
pharmaceuticals, as well as toxicological testing of industrial and naturally-occurring environmental 
5 compounds. All compounds induce characteristic gene expression patterns, frequently termed 
molecular fingerprints or toxicant signatures, which are indicative of mechanisms of action and 
toxicity (Nuwaysir, E.F. et al. (1999) Mol. Carcinog. 24:153-159; Steiner, S. and N.L. Anderson 
(2000) Toxicol. Lett. 1 12-1 13:467-471). If a test compound has a signature similar to that of a 
compound with known toxicity, it is likely to share those toxic properties. These fingerprints or 

10 signatures are most useful and refined when they contain expression information from a large number 
of genes and gene families. Ideally, a genome-wide measurement of expression provides the highest 
quality signature. Even genes whose expression is not altered by any tested conq)ounds are important 
as well, as the levels of expression of these genes are used to normalize the rest of the expression 
data. The normalization procedure is useful for comparison of expression data after treatment with 

15 different compounds. While the assignment of gene function to elements of a toxicant signature aids 
in interpretation of toxicity mechanisms, knowledge of gene function is not necessary for the 
statistical matching of signatures which leads to prediction of toxicity. (See, for example, Press 
Release 00-02 from the National Institute of Environmental Health Sciences, released February 29, 
2000, available at http://www.niehs.nih.gov/oc/news/toxchip.htm.) Therefore, it is important and 

20 desirable in toxicological screening using toxicant signatures to include all expressed gene sequences. 

In an embodiment, the toxicity of a test compound can be assessed by treating a biological - 
sample containing nucleic acids with the test compound. Nucleic acids that are expressed in the 
treated biological sample are hybridized with one or more probes specific to the polynucleotides of 
the present invention, so that transcript levels corresponding to the polynucleotides of the present 

•25 invention may be quantified. The transcript levels in the treated biological sample are compared with 
levels in an untreated biological sample. Differences in the transcript levels between the two samples 
are indicative of a toxic response caused by the test compound in the treated sample. 

Another embodiment relates to the use of the polypeptides disclosed herein to analyze the 
proteome of a tissue or cell type. The term proteome refers to the global pattern of protein expression 

30 in a particular tissue or cell type. Each protem component of a proteome can be subjected 
mdividually to further analysis. Proteome expression patterns, or profiles, are analyzed by 
quantifying the number of expressed proteins and their relative abundance under given conditions and 
at a given time. A profile of a cell's proteome may thus be generated by separating and analyzing the 
polypeptides of a particular tissue or cell type. In one embodiment, the separation is achieved using 

35 two-dimensional gel electrophoresis, in which proteins from a sample are separated by isoelectric 
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focusing in the first dimension, and then according to molecular weight by sodium dodecyl sulfate 
slab gel electrophoresis in the second dimension (Steiner and Anderson, supra). The proteins are 
visualized in the gel as discrete and uniquely positioned spots, typically by staining the gel with an 
agent such as Coomassie Blue or silver or fluorescent stains. The optical density of each protein spot 
5 is generally proportional to the level of the protein in the sample. The optical densities of 

equivalently positioned protein spots from different samples, for example, from biological saii5)les 
either treated or untreated with a test conapound or therapeutic agent, are compared to identify any 
changes in protein spot density related to the treatment. The proteins in the spots are partially 
sequenced using, for example, standard methods employing chemical or enzymatic cleavage followed 

10 by mass spectrometry. The identity of the protein in a spot may be determined by comparing its 

partial sequence, preferably of at least 5 contiguous amino acid residues, to the polypeptide sequences 
of interest. In some cases, further sequence data may be obtamed for definitive protem identification. 

A proteomic profile may also be generated using antibodies specific for PMOD to quantify 
the levels of PMOD expression. In one embodiment, the antibodies are used as elements on a 

15 microarray, and protein expression levels are quantified by exposing the microarray to the sample and 
detecting the levels of protein bound to each array element (Lueking, A. et al. (1999) Anal. Biochem. 
270:103-111; Mendoze, L.G. et al, (1999) Biotechniques 27:778-788). Detection may be performed 
by a variety of methods known in the art, for example, by reacting the proteins in the sample with a 
thiol- or amino-reactive fluorescent compound and detecting the amount of fluorescence bound at 

20 each array element. 

Toxicant signatures at the proteome level are also useful for toxicological screening, and 
should be analyzed in parallel with toxicant signatures at the transcript level. There is a poor 
correlation between transcript and protein abundances for some proteins in some tissues (Anderson, 
N.L. and J. Seilhamer (1997) Electrophoresis 18:533-537), so proteome toxicant signatures may be 

25 useful in the analysis of compounds which do not significantly affect the transcript image, but which 
alter the proteomic profile. In addition, the analysis of transcripts in body fluids is difficult, due to 
rapid degradation of mRNA, so proteomic profiling may be more reliable and informative in such 
cases. 

In another embodiment, the toxicity of a test compound is assessed by treating a biological 
30 sample containing proteins with the test compound. Proteins that are expressed in the treated 

biological sample are separated so that the amount of each protein can be quantified. The amount of 
each protein is compared to the amount of the corresponding protein in an untreated biological 
sample. A difference in the amount of protein between the two samples is indicative of a toxic 
response to the test cori:q)ound in the treated san^le. Individual proteins are identified by sequencing 
35 the amino acid residues of the individual proteins and comparing these partial sequences to the 
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polypeptides of the present invention. 

In another embodiment, the toxicity of a test compound is assessed by treating a biological 
sample containing proteins with the test compound. Proteins from the biological sample are 
incubated with antibodies specific to the polypeptides of the present invention. The amount of 
5 protein recognized by the antibodies is quantified. The amount of protein in the treated biological 
sample is compared with the amount in an untreated biological sample. A difference in the amount of 
protein between the two samples is indicative of a toxic response to the test compound in the treated 
sample. 

Microairays may be prepared, used, and analyzed using methods known in the art. (See, e.g., 

10 Brennan, T.M. et al. (1995) U.S. Patent No. 5,474,796; Schena, M. et al. (1996) Proc. Natl. Acad. Sci. 
USA 93:10614-10619; Baldeschweiler et al. (1995) PCT application W095/251116; Shalon, D. et al. 
(1995) PCT application WO95/35505; Heller, R.A. et al. (1997) Proc. Natl. Acad. Sci. USA 94:2150- 
2155; and Heller, M.J. et al. (1997) U.S. Patent No. 5,605,662.) Various types of microarrays are 
well known and thoroughly described in DNA Microarravs: A Practical Approach . M. Schena^ ed. 

15 (1999) Oxford University Press, London. 

In another embodiment of the invention, nucleic acid sequences encoding PMOD may be 
used to generate hybridization probes useful in mapping the naturally occurring genomic sequence. 
Either coding or noncoding sequences may be used, and in some instances, noncoding sequences may 
be preferable over coding sequences. For example, conservation of a coding sequence among 

20 members of a multi-gene family may potentially cause undesired cross hybridization during 

chromosomal mapping. The sequences may be mapped to a particular chromosome, to a specific 
region of a ciiromosome, or to artificial chromosome constructions, e.g., human artificial 
chromosomes (HACs), yeast artificial chromosomes (YACs), bacterial artificial chromosomes 
(BACs), bacterial PI constructions, or single chromosome cDNA libraries. (See, e.g., Harrington, J.J. 

25 et al. (1997) Nat. Genet. 15:345-355; Price, CM. (1993) Blood Rev. 7: 127-134; and Trask, BJ. 

(1991) Trends Genet. 7:149-154.) Once mapped, the nucleic acid sequences may be used to develop 
genetic linkage maps, for example, which correlate the inheritance of a disease state with the 
inheritance of a particular chromosome region or restriction fragment length polymorphism (RFLP). 
(See, for example. Lander, E.S. and D. Botstein (1986) Proc. Natl. Acad. Sci. USA 83:7353-7357.) 

30 Fluorescent in situ hybridization (FISH) may be correlated with other physical and genetic 

map data. (See, e.g., Heinz-Ulrich, et al. (1995) in Meyers, supra, pp. 965-968.) Examples of genetic 
map data can be found in various scientific journals or at the Online Mendelian Inheritance in Man 
(OMIM) World Wide Web site. Correlation between the location of the gene encoding PMOD on a 
physical map and a specific disorder, or a predisposition to a specific disorder, may help define the 

35 region of DNA associated with that disorder and thus may further positional cloning efforts. 



83 



wo 03/000844 



PCT/US02/19360 



In situ hybridization of chromosomal preparations and physical mapping techniques, such as 
linkage analysis using established chromosomal markers, may be used for extending genetic maps. 
Often the placement of a gene on the chromosome of another mammalian species, such as mouse, 
may reveal associated markers even if the exact chromosomal locus is not known. This information is 
5 valuable to investigators searching for disease genes using positional cloning or other gene discovery 
techniques. Once the gene or genes responsible for a disease or syndrome have been crudely 
localized by genetic linkage to a particular genomic region, e.g., ataxia-telangiectasia to llq22-23, 
any sequences mapping to that area nfiay represent associated or regulatory genes for further 
investigation. (See, e.g., Gatti, R.A. et al. (1988) Natxure 336:577-580.) The nucleotide sequence of 

10 the instant invention may also be used to detect differences in the chromosomal location due to 
translocation, inversion, etc., among normal, carrier, or affected individuals. 

In another embodiment of the invention, PMOD, its catalytic or inraiunogenic fragments, or 
oligopeptides thereof can be used for screening libraries of compounds in any of a variety of drug 
screening techniques. The fragment employed in such screening may be free in solution, affixed to a 

15 solid support, borne on a cell surface, or located intracellularly . The formation of binding complexes 
between PMOD and the agent being tested may be measured. 

Another technique for drug screening provides for high throughput screening of compounds 
having suitable binding affinity to the protein of interest. (See, e.g., Geysen, et al. (1984) PCT 
application WO84/03564.) In this method, large numbers of different small test compounds are 

20 synthesized on a solid substrate. The test conq)ounds are reacted with PMOD, or fragments thereof, 
and washed. Bound PMOD is then detected by methods well known in the art. Purified PMOD can 
also be coated directly onto plates for use in the aforementioned drug screening techniques. 
Alternatively, non-neutralizing antibodies can be used to capture the peptide and immobilize it on a 
solid support. 

25 In another embodiment, one may use competitive drug screening assays in which neutralizing 

antibodies capable of binding PMOD specifically compete with a test compound for bindmg PMOD. 

In this manner, antibodies can be used to detect the presence of any peptide which shares one or more 

antigenic determinants with PMOD. 

In additional embodiments, the nucleotide sequences which encode PMOD may be used in 
30 any molecular biology techniques that have yet to be developed, provided the new techniques rely on 

properties of nucleotide sequences that are currently known, including, but not limited to, such 

properties as the triplet genetic code and specific base pair interactions. 

Without further elaboration, it is believed that one skilled in the art can, using the preceding 

description, utilize the present mvention to its fullest extent. The following preferred specific 
35 embodiments are, therefore, to be construed as merely illustrative, and not limitative of the remainder 
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The disclosures of all patents, applications, and publications mentioned above and below, 
including U.S. Ser. No. 60/300,508, U.S. Ser. No. 60/303,445, U.S. Ser. No. 60/305,405, U.S. Ser. 
No. 60/311,442, U.S. Ser. No. 60/314,821, U.S. Ser. No. 60/315,992, and U.S. Ser. No. 60/378/205, 
5 are hereby expressly incorporated by reference. 

EXAMPLES 

I. Construction of cDNA Libraries 

Incyte cDNAs were derived from cDNA libraries described in the LIFESEQ GOLD database 

10 (Incyte Genomics, Palo Alto CA). Some tissues were homogenized and lysed in guanidinium 
isothiocyanate, while others were homogenized and lysed in phenol or in a suitable mixture of 
• denaturants, such as TRIZOL (Invitrogen), a monophasic solution of phenol and guanidine 
isothiocyanate. The resulting lysates were centrifiiged over CsCl cushions or extracted with 
chloroform. RNA was precipitated from the lysates with either isopropanol or sodium acetate and 

15 ethanol, or by other routine methods. 

Phenol extraction and precipitation of RNA were repeated as necessary to increase RNA 
purity. In some cases, RNA was treated with DNase. For most libraries, poly(A)+ RNA was isolated 
using oligo d(T)-coupled paramagnetic particles (Promega), OLIGOTEX latex particles (QIAGEN, 
Chatsworth CA), or an OLIGOTEX mRNA purification kit (QIAGEN). Alternatively, RNA was 

20 isolated directly from tissue lysates using other RNA isolation kits, e.g., the POLY(A)PURE mRNA 
purification kit (Ambion, Austin TX). 

In some cases, Stratagene was provided with RNA and constructed the corresponding cDNA 
libraries. Otherwise, cDNA was synthesized and cDNA libraries were constructed with the UNEAP 
vector system (Stratagene) or SUPERSCRIPT plasmid system (Invitrogen), using the feconmiended 

25 procedures or sinoilar methods known in the art: (See, e.g., Ausubel, 1997, supra, units 5.1-6.6.) 
Reverse transcription was mitiated' using oligo d(T) or random primers. Synthetic oligonucleotide 
adapters were ligated to double stranded cDNA, and the cDNA was digested with the appropriate 
restriction enzyme or enzymes. For most libraries, the cDNA was size-selected (300-1000 bp) using 
SEPHACRYL SIOOO, SEPHAROSE CL2B, or SEPHAROSE CL4B column chromatography 

30 (Amersham Biosciences) or preparative agarose gel electrophoresis. cDNAs were ligated into 
compatible restriction enzyme sites of the polylinker of a suitable plasmid, e.g., PBLUESCRIPT 
plasmid (Stratagene), PSPORTl plasmid (Invitrogen), PCDNA2.1 plasmid (Invitrogen, Carlsbad 
CA), PBK-CMV plasmid (Stratagene), PCR2-TOPOTA plasmid (Invitrogen), PCMV-ICIS plasmid 
(Stratagene), pIGEN (Incyte Genomics, Palo Alto CA), pRARE (Incyte Genomics), or pDSfCY (Incyte 

35 Genomics), or derivatives thereof. Recombinant plasmids were transformed into competent E, coli 
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cells including XLl-BIue, XLl-BlueMRF, or SOLR from Stratagene or DH5a, DHIOB, or 
ElectroMAX DHIOB from Invitrogen. 
II. Isolation of cDNA Clones 

Plasmids obtained as described in Example I were recovered from host cells by in vivo 

5 excision using the UNIZAP vector system (Stratagene) or by cell lysis. Plasmids were purified using 
at least one of the following: a Magic or WIZARD Minipreps DNA purification system (Promega); an 
AGTC Miniprep purification kit (Edge Biosystems, Gaithersburg MD); and QIAWELL 8 Plasmid, 
QIAWELL 8 Plus Plasmid, QIAWELL 8 Ultra Plasmid purification systems or the R.E. A.L. PREP 96 
plasmid purification kit from QIAGEN. FoUowing.precipitation, plasmids were resuspended in 0.1 

10 nal of distilled water and stored, with or without lyophilization, at 4°C 

Alternatively, plasmid DNA was amplified from host cell lysates using direct link PGR in a 
high-throughput formiat (Rao, V.B. (1994) Anal. Biochem. 216:1-14). Host cell lysis and thennal 
cycling steps were carried out in a single reaction mixture. Samples were processed and stored in 
384-well plates, and the concentration of amplified plasmid DNA was quantified fluorometrically 

15 using PICOGREEN dye (Molecular Probes, Eugene OR) and a FLUOROSKAN n fluorescence 
scanner (Labsystems Oy, Helsinki, Finland), 
in. Sequencing and Analysis 

Incyte cDNA recovered in plasmids as described in Example n were sequenced as follows. 
Sequencing reactions were processed using standard methods or high-throughput instrumentation 

20 such as the ABI CATALYST 800 (Applied Biosystems) thermal cycler or the PTC-200 thermal 
cycler (MI Research) in conjunction with the HYDRA microdispenser (Robbins Scientific) or the 
MICROLAB 2200 (Hamilton) liquid transfer system. cDNA sequencing reactions were prepared 
using reagents provided by Amersham Biosciences or supplied in ABI sequencing kits such as the 
ABI PRISM BIGDYE Terminator cycle sequencing ready reaction kit (Applied Biosystems). 

25 Electrophoretic separation of cDNA sequencing reactions and detection of labeled polynucleotides 
were carried out using the MEGABACE 1000 DNA sequencing system (Amersham Biosciences); the 
ABI PRISM 373 or 377 sequencing system (Applied Biosystems) in conjunction with standard ABI 
protocols and base calling software; or other sequence analysis systems known in the art. Reading 
frames within the cDNA sequences were identified using standard methods (reviewed in Ausubel, 

30 1997, supra, unit 7.7). Some of the cDNA sequences were selected for extension using the 
techniques disclosed in Example VHI. 

The polynucleotide sequences derived from Incyte cDNAs were validated by removing 
vector, linker, and poly(A) sequences and by masking ambiguous bases, using algorithms and 
programs based on BLAST, dynamic programming, and dinucleotide nearest neighbor analysis. The 

35 Incyte cDNA sequences or translations thereof were then queried against a selection of public 
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databases such as the GenBank primate, rodent, mammalian, vertebrate, and eukaryote databases, and 
BLOCKS, PRINTS, DOMO, PRODOM; PROTEOME databases with sequences from Homo sapiens, 
Rattiis norvegicus, Mus musculus, Caenorhabditis elegans, Saccharomyces cerevisiae, 
Schizosaccharomyces pombe, and Candida albicans (Incyte Genomics, Palo Alto CA); hidden 
5 Markov model (HMM)-based protein family databases such as PFAM, INCY, and TIGRFAM (Haft, 
D.H. et al. (2001) Nucleic Acids Res. 29:41-43); and HMM-based protein domain databases such as 
SMART (Schultz et al. (1998) Proc. Natl. Acad. Sci. USA 95:5857-5864; Letunic, L et al. (2002) 
Nucleic Acids Res. 30:242-244). (HMM is a probabilistic approach which analyzes consensus 
primary structures of gene families. See, for example, Eddy, S.R. (1996) Curr. Opin. Struct. Biol. 

10 6:361-365.) The queries were performed using programs based on BLAST, FASTA, BLIMPS, and 
HMMER. The Incyte cDNA sequences were assembled to produce fiill length polynucleotide 
sequences. Alternatively, GenBank cDNAs, GenBank ESTs, stitched sequences, stretched sequences, 
or Genscan-predicted coding sequences (see Examples IV and V) were used to extend Incyte cDNA 
assemblages to full length. Assembly was performed using programs based on Phred, Phrap, and 

15 Consed, and cDNA assemblages were screened for open reading frames using programs based on 
GeneMark, BLAST, and FASTA. The full length polynucleotide sequences were translated to derive 
the corresponding full length polypeptide sequences. Alternatively, a polypeptide may begin at any 
of the methionine residues of the fiill length translated polypeptide. Full length polypeptide 
sequences were subsequently analyzed by querying against databases such as the GenBank protein 

20 databases (genpept), SwissProt, the PROTEOME databases, BLOCKS, PRINTS, DOMO, PRODOM, 
Prosite, hidden Markov model (HMM)-based protein family databases such as PFAM, INCY, and 
TIGRFAM; and HMM-based protein domain databases such as SMART. Full length polynucleotide 
sequences are also analyzed using MACDNASIS PRO software (Hitachi Software Engineering, 
South San Francisco CA) and LASERGENE software (DNASTAR). Polynucleotide and polypeptide 

25 sequence alignments are generated using default parameters specified by the CLUSTAL algorithm as 
incorporated into the MEGALIGN multisequence alignment program (DNASTAR), which also 
calculates the percent identity between aligned sequences. 

Table 7 summarizes the tools, programs, and algorithms used for the analysis and assembly of 
Incyte cDNA and full length sequences and provides applicable descriptions, references, and 

30 threshold parameters. The first column of Table 7 shows the tools, programs, and algorithms used, 
the second column provides brief descriptions thereof, the third column presents appropriate 
references, all of which are incorporated by reference herein in then- entirety, and the fourth column 
presents, where applicable, the scores, probability values, and other parameters used to evaluate the 
strength of a match between two sequences (the higher the score or the lower the probability value, 

35 the greater the identity between two sequences). 
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The programs described above for the assembly and analysis of full length polynucleotide 
and polypeptide sequences were also used to identify polynucleotide sequence fragments from SEQ 
ID NO:29-56. Fragments from about 20 to about 4000 nucleotides which are useful in hybridization 
and amplification technologies are described in Table 4, column 2. 
5 IV. Identiflcation and Editing of Coding Sequences from Genomic DNA 

Putative protein modification and maintenance molecules were initially identified by running 
the Genscan gene identification program against public genomic sequence databases (e.g., gbpri and 
gbhtg). Genscan is a general-purpose gene identification program which analyzes genomic DNA 
sequences from a variety of organisms (See Burge, C. and S. Karlin (1997) J. Mol. Biol. 268:78-94, 

10 and Burge, C. and S. Karlm (1998) Curr. Opin. Struct. Biol. 8:346-354). The program concatenates 
predicted exons to form an assembled cDNA sequence extending from a methionine to a stop codon. 
The output of Genscan is a FASTA database of polynucleotide and polypeptide sequences. The 
maximum range of sequence for Genscan to analyze at once was set to 30 kb. To determine which of 
these Genscan predicted cDNA sequences encode protein modification and maintenance molecules, 

15 the encoded polypeptides were analyzed by querying against PFAM models for protein modification 
and maintenance molecules. Potential protein modification and maintenance molecules were also 
identified by homology to hicyte cDNA sequences tiiat had been annotated as protein modification 
and mamtenance molecules. These selected Genscan-predicted sequences were then compared by 
BLAST analysis to tht genpept and gbpri public databases. Where necessary, the Genscan-predicted 

20 sequences were then edited by comparison to the top BLAST hit from genpept to correct errors in the 
sequence predicted by Genscan, such as extra or omitted exons. BLAST analysis was also used to 
find any Incyte cDNA or public cDNA coverage of the Genscan-predicted sequences, thus providmg 
evidence for transcription. When Incyte cDNA coverage was available, this information was used to 
correct or confirm the Genscan predicted sequence. Full length polynucleotide sequences were 

25 obtained by assembling Genscan-predicted coding sequences with Incyte cDNA sequences and/or 
pubUc cDNA sequences using the assembly process described in Example HI. Altematively, full 
length polynucleotide sequences were derived entirely from edited or unedited Genscan-predicted 
coding sequences. 

V. Assembly of Genomic Sequence Data with cDNA Sequence Data 

30 "Stitched" Sequences 

Partial cDNA sequences were extended with exons predicted by the Genscan gene 
identification program described in Example IV. Partial cDNAs assembled as described in Example 
in were mapped to genomic DNA and parsed into clusters containing related cDNAs and Genscan 
exon predictions from one or more genomic sequences. Each cluster was analyzed using an algorithm 

35 based on graph theory and dynamic programming to integrate cDNA and genomic information. 
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generating possible splice variants that were subsequently confirmed, edited, or extended to create a 
full length sequence. Sequence intervals in which the entire length of the interval was present on 
more than one sequence in the cluster were identified, and intervals thus identified were considered to 
be equivalent by transitivity. For example, if an interval was present on a cDNA and two genomic 
5 sequences, then all three intervals were considered to be equivalent. This process allows unrelated 
but consecutive genomic sequences to be brought together, bridged by cDNA sequence. Intervals 
thus identified were then "stitched" together by the stitching algorithm in the order that they appear 
along their parent sequences to generate the longest possible sequence, as well as sequence variants. 
Linkages between intervals which proceed along one type of parent sequence (cDNA to cDNA or 

10 genomic sequence to genonaic sequence) were given preference over linkages which change parent 
type (cDNA to genomic sequence). The resultant stitched sequences were translated and compared 
by BLAST analysis to the genpept and gbpri public databases. Incorrect exons predicted by Genscan 
were corrected by comparison to the top BLAST hit from genpept. Sequences were further extended 
with additional cDNA sequences, or by inspection of genomic DNA, when necessary. 

15 ^'Stretched'' Sequences 

Partial DNA sequences were extended to full length with an algorithm based on BLAST 
analysis. First, partial cDNAs assembled as described in Example HI were queried against public 
databases such as the GenBank primate, rodent, mammalian, vertebrate, and eukaryote databases 
using the BLAST program. The nearest GenBank protein homolog was then compared by BLAST 

20 analysis to either Incyte cDNA sequences or GenScan exon predicted sequences described in 
Example IV. A chimeric protein was generated by using the resultant high-scoring segment pairs 
(HSPs) to map the translated sequences onto the GenBank protein homolog. Inserdons or deletions 
may occur in the chimeric protein with respect to the original GenBank protein homolog. The 
GenBank protein homolog; the chimeric protein, or both were used as probes to search for 

25 homologous genomic sequences from the public human genome databases. Partial DNA sequences 
were therefore "stretched" or extended by the addition of homologous genomic sequences. The 
resultant stretched sequences were examined to determine whether it contained a complete gene. 
VI. Chromosomal Mapping of PMOD Encoding Polynucleotides 

The sequences which were used to assemble SEQ ID NO:29-56 were compared with 

30 sequences fi-om the Incyte LIFESEQ database and public domain databases using BLAST and other 
implementations of the Smith-Waterman algorithm. Sequences from these databases that matched 
SEQ ID NO:29-56 were assembled into clusters of contiguous and overlapping sequences using 
assembly algorithms such as Phrap (Table 7). Radiation hybrid and genetic mapping data available 
from public resources such as the Stanford Human Genome Center (SHGC), Whitehead Institute for 

35 Genome Research (WIGR), and G6n6thon were used to determine if any of the clustered sequences 
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had been previously mapped. Inclusion of a mapped sequence in a cluster resulted in the assignment 

of all sequences of that cluster, including its particular SEQ ID NO:, to that map location. 

Map locations are represented by ranges, or intervals, of human chromosomes. The map 

position of an interval, in centiMorgans, is measured relative to the terminus of the chromosome's p- 
5 arm. (The centiMorgan (cM) is a unit of measurement based on recombination frequencies between 

chromosomal markers. On average, 1 cM is roughly equivalent to 1 megabase (Mb) of DNA in 

humans, although this can vary widely due to hot and cold spots of recombination.) The cM 

distances are based on genetic markers mapped by G6nethon which provide boundaries for radiation 

hybrid markers whose sequences were included in each of the clusters. Human genome maps and 
10 other resources available to the public, such as the NCBI "GeneMap'99" World Wide Web site 

(http://www.ncbi.nlm.mh.gov/genemap/), can be employed to determine if previously identified 

disease genes map within or in proximity to the intervals indicated above. 

VII. Analysis of Polynucleotide Expression 

Northern analysis is a laboratory technique used to detect the presence of a transcript of a 
15 gene and involves the hybridization of a labeled nucleotide sequence to a membrane on which RNAs 

from a particular cell type or tissue have been bound. (See, e.g., Sambrook, supra, ch. 7; Ausubel 

(1995) supra, ch. 4 and 16.) 

Analogous computer techniques applying BLAST were used to search for identical or related 

molecules in cDNA databases such as GenBank or LIFESEQ (Incyte Genomics). This analysis is 
20 much faster than multiple membrane-based hybridizations. In addition, the sensitivity of the 

computer search can be modified to determine whether any particular match is categorized as exact or 

similar. The basis of the search is the product score, which is defined as: 

BLAST Score x Percent Identity 
25 5 X minimum {length(Seq. 1), length(Seq. 2)} 

The product score takes into account both the degree of similarity between two sequences and the 
length of the sequence match. The product score is a normalized value between 0 and 100, and is 
calculated as follows: the BLAST score is multiplied by the percent nucleotide identity and the 

30 product is divided by (5 times the length of the shorter of the two sequences). The BLAST score is 
calculated by assigning a score of +5 for every base that matches in a high-scoring segment pair 
(HSP), and -4 for every mismatch. Two sequences may share more than one HSP (separated by 
gaps). If there is more than one HSP, then the pair with the highest BLAST score is used to calculate 
the product score. The product score represents a balance between fractional overlap and quality in a 

35 BLAST alignment. For example, a product score of 100 is produced only for 100% identity over the 
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entire length of the shorter of the two sequences being compared. A product score of 70 is produced 
either by 100% identity and 70% overlap at one end, or by 88% identity and 100% overlap at the 
other. A product score of 50 is produced either by 100% identity and 50% overlap at one end, or 79% 
identity and 100% overlap, 
5 Alternatively, polynucleotides encoding PMOD are analyzed with respect to the tissue 

sources from which they were derived. For exan^)le, some full length sequences are assembled, at 
least in part, with overlapping Incyte cDNA sequences (see Example HI). Each cDNA sequence is 
derived from a cDNA library constructed from a human tissue. Each human tissue is classified into 
one of the following organ/tissue categories: cardiovascular system; connective tissue; digestive 

10 system; embryonic structures; endocrine system; exocrine glands; genitaHa, female; genitalia, male; 
germ cells; hemic and immune system; liver; musculoskeletal system; nervous system; pancreas; 
respiratory system; sense organs; skin; stomatognathic system; unclassified/mixed; or urinary tract. 
The number of libraries in each category is counted and divided by the total number of libraries 
across all categories. Similarly, each human tissue is classified into one of the following 

15 disease/condition categories: cancer, cell line, developmental, inflammation, neurological, trauma, 
cardiovascular, pooled, and other, and the number of libraries in each category is counted and divided 
by the total number of libraries across all categories. The resulting percentages reflect the tissue- and 
disease-specific expression of cDNA encoding PMOD. cDNA sequences and cDNA library/tissue 
information are found in the LIFESEQ GOLD database (Incyte Genomics, Palo Alto CA). 

20 VIIL Extension of PMOD Encoding Polynucleotides 

Full length polynucleotides are produced by extension of an appropriate fragment of the full . 
length molecule using oligonucleotide primers designed from this fragment. One primer was 
synthesized to initiate 5' exteinsion of the known fragment, and the other primer was synthesized to 
initiate 3' extension of the known fragment. The initial primers were designed using OLIGO 4.06 

25 software (National Biosciences), or another appropriate program, to be about 22 to 30 nucleotides in 
length, to have a GC content of about 50% or more, and to anneal to the target sequence at 
temperatures of about SS^'C to about 72*'C. Any stretch of nucleotides which would result in hairpin 
structures and primer-primer dimerizations was avoided. 

Selected human cDNA libraries were used to extend the sequence. If more than one 

30 extension was necessary or desired, additional or nested sets of primers were designed. 

High fidelity amplification was obtained by PGR using methods well known in the art. PGR 
was performed in 96-well plates using the PTG-20D thermal cycler (MJ Research, Inc.). The reaction 
mix contained DNA template, 200 nmol of each primer, reaction buffer containing Mg^"^, (NH4)2S04, 
and 2-mercaptoethanol, Taq DNA polymerase (Amersham Biosciences), ELONGASE enzyme 

35 (Invitrogen), and Pfu DNA polymerase (Stratagene), with the following parameters for primer pair 
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PCI A and PCI B: Step 1: 94°C, 3 min; Step 2: 94°C, 15 sec; Step 3: 60X, 1 min; Step 4: 68°C, 2 
min; Step 5: Steps 2, 3, and 4 repeated 20 times; Step 6: 68 X, 5 min; Step 7: storage at 4°C. In the 
alternative, the parameters for primer pair T7 and SK+ were as follows: Step 1: 94 "^C, 3 min; Step 2: 
94°C, 15 sec; Step 3: 57X, 1 min; Step 4: 68°C, 2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; 
5 Step 6: 68 °C, 5 niin; Step 7: storage at 4°C. 

The concentration of DNA in each well was determined by dispensing 100 fi\ PICOGREEN 
quantitation reagent (0.25% (v/v) PICOGREEN; Molecular Probes, Eugene OR) dissolved m IX TE 
and 0.5 [il of undiluted PCR product into each well of an opaque fluorimeter plate (Coming Costar, 
Acton MA), allowing the DNA to bind to the reagent. The plate was scanned in a Fluoroskan n 
10 (Labsystems Oy, Helsmki, Finland) to measure the fluorescence of the sample and to quantify the 
concentration of DNA. A 5 ;Ul to 10 //I aliquot of the reaction mixture was analyzed by 
electrophoresis on a 1 % agarose gel to determine which reactions were successful in extendmg the 
sequence. 

The extended nucleotides were desalted and concentrated, transferred to 384-well plates, 

15 digested with CviJI cholera virus endonuclease (Molecular Biology Research, Madison WI), and 
sonicated or sheared prior to religation into pUC 18 vector (Amersham Biosciences). For shotgun 
sequencing, the digested nucleotides were separated on low concentration (0.6 to 0.8%) agarose gels, 
fragments were excised, and agar digested with Agar ACE (Promega). Extended clones were 
reUgated using T4 ligase (New England Biolabs, Beverly MA) mto pUC 18 vector (Amersham 

20 Biosciences), treated with Pfu DNA polymerase (Stratagene) to fill-in restriction site overhangs, and 
transfected into competent E. coli cells. Transformed cells were selected on antibiotic-contaming 
media, and individual colonies were picked and cultured overnight at 37 "^C in 384-well plates in - 
LB/2x carb liquid media. 

The cells were lysed, and DNA was amplified by PCR usmg Taq DNA polymerase 

25 (Amersham Biosciences) and Pfu DNA polymerase (Stratagene) with the following parameters: Step 
1: 94^C, 3 min; Step 2: 94^0, 15 sec; Step 3: 60^C, 1 min; Step 4: 72"C, 2 min; Step 5: steps 2, 3, 
and 4 repeated 29 times; Step 6: 72 °C, 5 min; Step 7: storage at 4°C. DNA was quantified by 
PICOGREEN reagent (Molecular Probes) as described above. Samples with low DNA recoveries 
were reamplified using the same conditions as described above. Samples were diluted with 20% 

30 dimethy sulfoxide (1 :2, v/v), and sequenced using DYENAMIC energy transfer sequencing primers 
and the DYENAMIC DIRECT kit (Amersham Biosciences) or the ABI PRISM BIGDYE Terminator' 
cycle sequencing ready reaction kit (Applied Biosystems). 

In like manner, full length polynucleotides are verified using the above procedure or are used 
to obtain 5' regulatory sequences using the above procedure along with oligonucleotides designed for 

35 such extension, and an appropriate genomic library. 
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EX. Identification of Single Nucleotide Polymorphisms in PMOD Encoding Polynucleotides 

Common DNA sequence variants known as single nucleotide polymorphisms (SNPs) were 
identified in SEQ ID NO:29-56 using the LIFESEQ database (Incyte Genomics). Sequences from the 
same gene were clustered together and assembled as described in Example HI, allowing the 
5 identification of all sequence variants in the gene. An algorithm consisting of a series of filters was 
used to distinguish SNPs from other sequence variants. Preliminary filters removed the majority of 
basecall errors by requiring a minimum Phred quality score of 15, and removed sequence alignment 
errors and errors resulting from improper trimming of vector sequences, chimeras, and splice 
variants. An automated procedure of advanced chromosome analysis analysed the original 

10 chromatogram files in the vicinity of the putative SNP. Clone error filters used statistically generated 
algorithntis to identify errors introduced during laboratory processing, such as those caused by reverse 
transcriptase, polymerase, or somatic mutation. Clustering error filters used statistically generated 
algorithms to identify errors resulting from clustering of close homologs or pseudogenes, or due to 
contamination by non-human sequences. A final set of filters removed duplicates and SNPs found in 

15 immunoglobulins or T-cell receptors. 

Certain SNPs were selected for further characterization by mass spectrometry using the high 
throughput MASSARRAY system (Sequenom, Inc.) to analyze allele frequencies at the SNP sites in 
four different human populations. The Caucasian population comprised 92 individuals (46 male, 46 
female), mcluding 83 from Utah, four French, three Venezualan, and two Amish individuals. The 

20 African population comprised 194 individuals (97 male, 97 female), all African Americans. The 
Hispanic population comprised 324 individuals (162 male, 162 female), all Mexican Hispanic. The 
Asian population comprised 126 individuals (64 male, 62 female) with a reported parental breakdown 
of 43% Chinese, 31% Japanese, 13% Korean, 5% Vietnamese, and 8% other Asian. Allele 
frequencies were first analyzed in the Caucasian population; in some cases those SNPs which showed 

25 no allelic variance in this population were not fiulher tested in the other three populations. 
X. Labeling and Use of Individual Hybridization Probes 

Hybridization probes derived from SEQ ID NO:29-56 are employed to screen cDNAs, 
genomic DNAs, or mRNAs. Although the labeling of oligonucleotides, consisting of about 20 base 
pairs, is specifically described, essentially the same procedure is used with larger nucleotide 

30 fragments. Oligonucleotides are designed using state-of-the-art software such as OLIGO 4.06 
software (National Biosciences) and labeled by combining 50 pmol of each oligomer, 250 /^Ci of 
[y-^^P] adenosine triphosphate (Amersham Biosciences), and T4 polynucleotide kinase (DuPont NEN, 
Boston MA). The labeled oligonucleotides are substantially purified using a SEPHADEX G-25 
superfine size exclusion dextran bead colunm (Amersham Biosciences). An aliquot containing 10^ 

35 counts per minute of the labeled probe is used in a typical membrane-based hybridization analysis of 
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human genomic DNA digested with one of the following endonucleases: Ase I, Bgl II, Eco RI, Pst I, 
Xba I, or Pvu H (DuPont NEN). 

The DNA from each digest is fractionated on a 0.7% agarose gel and transferred to nylon 
membranes (Nytran Plus, Schleicher & Schuell, Durham NH). Hybridization is carried out for 16 
5 hours at 40°C. To remove nonspecific signals, blots are sequentially washed at room temperature 
under conditions of up to, for example, 0.1 x saline sodium citrate and 0.5% sodium dodecyl sulfate. 
Hybridization patterns are visualized using autoradiography or an altemative imaging means and 
compared. 

XI, Microarrays 

10 The linkage or synthesis of array elements upon a microarray can be achieved utilizing 

photolithography, piezoelectric printing (ink-jet printing, See, e.g., Baldeschweiler, supra.\ 
mechanical mdcrpspotting technologies, and derivatives thereof. The substrate in each of the 
aforementioned technologies should be uniform and solid with a non-porous surface (Schena (1999), 
supra). Suggested substrates include silicon, silica, glass slides, glass chips, and silicon wafers. 

15 Alternatively, a procedure analogous to a dot or slot blot may also be used to arrange and link 
elements to the surface of a substrate using thermal, UV, chenrical, or mechanical bonding 
procedures. A typical array may be produced using available methods and machines well known to 
those of ordinary skill in the art and may contain any appropriate number of elements. (See, e.g., 
Schena, M. et al, (1995) Science 270:467-470; Shalon, D. et al. (1996) Genome Res. 6:639-645; 

20 Marshall, A. and J. Hodgson (1998) Nat Biotechnol. 16:27-31.) 

Full length cDNAs, Expressed Sequence Tags (ESTs), or firagments or oligomers thereof may 
comprise the elements of the microarray. Fragments or oligomers suitable for hybridization can be 
selected using software well known m the art such as LASERGENE software (DNASTAR). The 
array elements are hybridized with polynucleotides in a biological sample. The polynucleotides in the 

25 '' biological sample are conjugated to a fluorescent label or other molecular tag for ease of detection. 
After hybridization, nonhybridized nucleotides from the biological sample are removed, and a 
fluorescence scanner is used to detect hybridization at each array element. Alternatively, laser 
desorbtion and mass spectrometry may be used for detection of hybridization. The degree of 
complementarity and the relative abundance of each polynucleotide which hybridizes to an element 

30 on the microarray may be assessed. In one embodiment, microarray preparation and usage is 
described in detail below. 

Tissue or Cell Sample Preparation 

Total RNA is isolated from tissue samples using the guanidinium tliiocyanate method and 
35 poly(A)* RNA is purified using the oligo-(dT) cellulose method. Each poly (A) RNA sample is 
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reverse transcribed using MMLV reverse-transcriptase, 0.05 pg/^l oligo-(dT) primer (21mer), IX 
first strand buffer, 0.03 units/jLtl RNase inhibitor, 500 (M dATP, 500 piM dGTP, 500 jliM dTTP, 40 
liM dCTP, 40 fiM dCTP-Cy3 (BDS) or dCTP-Cy5 (Amersham Biosciences). The reverse 
transcription reaction is performed in a 25 ml volume containing 200 ng poly(A)* RNA v^th 
5 GBMBRIGHT kits (Incyte). Specific control poly(A)^ RNAs are synthesized by in vitro transcription 
from non-coding yeast genomic DNA. After incubation at 37° C for 2 hr, each reaction sample (one 
with Cy3 and another with Cy5 labeling) is treated with 2.5 ml of 0.5M sodium hydroxide and 
incubated for 20 minutes at 85** C to the stop the reaction and degrade the RNA. Samples are purified 
using two successive CHROMA SPIN 30 gel filtration spin colunms (CLONTECH Laboratories, Inc. 
10 (CLONTECH), Palo Alto CA) and after combining, both reaction samples are ethanol precipitated 
using 1 ml of glycogen (1 mg/ml), 60 ml sodium acetate, and 300 ml of 100% ethanol. The sample is 
ihen dried to completion using a SpeedVAC (Savant Instruments Inc., Holbrook NY) and 
resuspended in 14 jLtl 5X SSC/0.2% SDS. 
Microarrav Preparation 

15 Sequences of the present invention are used to generate array elements. Each array element 

is amplified from bacterial cells containing vectors widi cloned cDNA inserts. PCR amplification 
uses primers complementary to the vector sequences flanking the cDNA insert. Array elements are 
amplified in thirty cycles of PCR from an initial quantity of 1-2 ng to a final quantity greater than 5 
jLtg. Amplified array elements are then purified using SEPHACRYL-400 (Amersham Biosciences). 

20 Purified array elements are immobilized on polymer-coated glass slides. Glass microscope 

slides (Coming) are cleaned by ultrasound in 0.1% SDS and acetone, with extensive distilled water ' 
washes between and after treatments. Glass slides are etched in 4% hydrofluoric acid (VWR 
Scientific Products Corporation (VWR), West Chester PA), washed extensively in distilled water, 
and coated with 0.05% aminopropyl silane (Sigma) in 95% ethanol. Coated slides are cured in a 

25 llO^Coven. 

Array elements are applied to the coated glass substrate using a procedure described in U.S. 
Patent No. 5,807,522, incorporated herein by reference. 1 jxl of the array element DNA, at an average 
concentration of 100 ng//xl, is loaded into the open capillary printing element by a high-speed robotic 
apparatus. The apparatus then deposits about 5 nl of array element sample per slide. 
30 Microarrays are UV-crosslinked using a STRATALINKER UV-crosslinker (Stratagene). 

Microarrays are washed at room temperature once in 0.2% SDS and three times in distilled water. 
Non-specific binding sites are blocked by incubation of microarrays in 0.2% casein in phosphate 
buffered saline (PBS) (Tropix, Inc., Bedford MA) for 30 minutes at 60° C followed by washes in 
0.2% SDS and distilled water as before. 

35 
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Hybridization 

Hybridization reactions contain 9 /xl of sample nnixture consisting of 0.2 fig each of Cy3 and 
Cy5 labeled cDNA synthesis products in 5X SSC, 0.2% SDS hybridization buffer. The sample 
mixture is heated to 65°C for 5 minutes and is aliquoted onto the microarray surface and covered 
5 with an 1.8 cm^ coverslip. The eirrays are transferred to a waterproof chamber having a cavity just 
slightly larger than a microscope slide. The chamber is kept at 100% humidity internally by the 
addition of 140 /xl of 5X SSC in a comer of the chamber. The chamber containing the arrays is 
incubated for about 6.5 hours at 60**C The arrays are washed for 10 min at 45° C in a first wash 
buffer (IX SSC, 0.1% SDS), three times for 10 minutes each at 45*'C in a second wash buffer (O.IX 
10 SSC), and dried. 
Detection 

Reporter-labeled hybridization complexes are detected with a microscope equipped with an 
Innova 70 mixed gas 10 W laser (Coherent, Inc., Santa Qaia CA) capable of generating spectral lines 
at 488 mn for excitation of Cy3 and at 632 nm for excitation of Cy5. The excitation laser light is 

15 focused on the array usmg a 20X microscope objective (Nikon, Inc., Melville NY). The slide 
containing the array is placed on a computer-controlled X-Y stage on the microscope and raster- 
scanned past the objective. The 1.8 cm x 1.8 cm array used m the present example is scanned with a 
resolution of 20 micrometers. 

In two separate scans, a mixed gas multiline laser excites the two fluorophores sequentially. 

20 Emitted light is split, based on wavelength, into two photomultiplier tube detectors (PMT R1477, 

Hamamatsu Photonics Systems, Bridgewater NJ) corresponding to the two fluorophores. Appropriate 
filters positioned between the array and the photomultiplier tubes are used to filter the signals. The 
emission maxima of the fluorophores used are 565 nm for Cy3 and 650 hm for Cy5. Each array is 
typically scanned twice, one scan per fluorophore using the appropriate filters at the laser source, 

25 although the apparatus is capable of recording the spectra from both fluorophores simultaneously. 

The sensitivity of the scans is typically calibrated using the signal intensity generated by a 
cDNA control species added to the sample mixture at a known concentration. A specific location on 
the array contains a complementary DNA sequence, allowing the intensity of the signal at that 
location to be correlated with a weight ratio of hybridizing species of 1: 100,000. When two samples 

30 firom different sources (e.g., representing test and control cells), each labeled with a different 
fluorophore, are hybridized to a single array for the purpose of identifying genes that are 
differentially expressed, the calibration is done by labeling samples of the calibrating cDNA with the 
two fluorophores and adding identical amounts of each to the hybridization mixture. 

The output of the photomultiplier tube is digitized using a 12-bit RTI-835H analog-to-digital 

35 (A/D) conversion board (Analog Devices, Inc;, Norwood MA) installed in an IBM-compatible PC 
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computer. The digitized data are displayed as an image where the signal intensity is mapped using a 
linear 20-color transformation to a pseudocolor scale ranging from blue (low signal) to red (high 
signal). The data is also analyzed quantitatively. Where two different fluorophores are excited and 
measured simultaneously, the data are first corrected for optical crosstalk (due to overlapping 
5 enaission spectra) between the fluorophores using each fluorophore's emission spectrum. 

A grid is superimposed over the fluorescence signal image such that the signal from each 
spot is centered in each element of the grid. The fluorescence signal within each element is then 
integrated to obtain a numerical value corresponding to the average intensity of the signal. The 
software used for signal analysis is the GEMTOOLS gene expression analysis program (Incyte). 
10 Array elements that exhibited at least about a two-fold change in expression, a signal-to-background 
ratio of at least 2.5, and an element spot size of at least 40% were identified as differentially 
expressed using the GEMTOOLS program (Incyte Genomics). 
Expression 

The human C3 A cell line is a clonal derivative of HepG2/C3 (hepatoma cell line, isolated 

15 from a IS-yearold male with liver tumor), which was selected for strong contact inhibition of growth. 
The use of a clonal population enhances the reproducibility of the cells. C3A cells have many 
characteristics of primary human hepatocytes in culture: i) expression of insulin receptor and insulin- 
like growth factor n receptor; ii) secretion of a high ratio of serum albumin compared witli a- 
fetoprotein; iii) conversion of ammonia to urea and glutamine; iv) metabolism of aromatic amino 

20 acids; and v) proliferation in glucose-free and insulm-free medium. The C3A cell line is now well 
established as an in vitro model of the mature human liver (Mickelson et al. (1995) Hepatology 
22:866-875; Nagendra et al. (1997) Am. J. Physiol. 272:G408-G416). The expression of SEQ ID 
NO:29 was altered by a factor of 2 or more in cells treated with a variety of steroids including 
prednisone, dexamethasone, medroxyprogesterone, budesonide, and beclomthasone. In addition, the 

25 expression of SEQ ID NO:29 was was altered by a factor of two or more in C3A cells. Therefore, 
SEQ ID NO:29 can be used in assays related to treatment for cell proliferative disorders. 

For example, SEQ ID N0:31 and SEQ ID NO:34 showed differential expression in breast 
tumor cell lines versus normal breast epithelial cells as determined by microarray analysis. The 
expression of SEQ ID NO:31 and SEQ ID NO;34 was decreased by at least two fold in breast tumor 

30 cell lines that were harvested from donors with both early and late stages of tumor progression and 
malignant transformation. Therefore, SEQ ID N0:31 and SEQ ID NO:34 can be used in diagnostic . 
assays for breast cancer. 

In another example, SEQ ID NO:33 showed differential expression in response to several 
compounds which produce known metabolic and toxicological responses. The expression of SEQ ID 

35 NO: 33 was reduced by at least two fold in the human C3A liver cell line incubated for varying 
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lengths of time with compounds including fenofibrate, clofibrate, dexamethasone, beclomethasone, 
medroxyprogesterone, budesonide, and betamethasone. Therefore, SEQ ID NO: 33 can be used in 
toxicology testing. 

SEQ ID NO:35 showed differential expression in human peripheral blood mononuclear cells 
5 (PBMCs) following exposure to 5 and 25 fxM prednisone for 24 hours. Prednisone is a corticosteroid 
that is metabolized in the liver to its active form, prednisolone. Prednisone is approximately four 
times more potent as a glucocorticoid than hydrocortisone. Glucocorticoids are naturally occurring 
hormones that prevent or suppress inflammation and immune responses when administered at 
pharmacologic doses. At the molecular level, unbound glucocorticoids readily cross cell membranes 
10 and bind with high affinity to specific cytoplasmic receptors. Subsequent to bindmg, transcription 
^nd, ultimately, protein synthesis are affected. The result can include inhibition of leukpcyte 
infiltration at the site of inflammation, interference in the function of mediators of the mflaimnatory 
response, and suppression of humoral immune responses. PBMCs can be classified into discrete 
cellular populations representing the major cellular components of the unmune system. PBMCs 

15 contain about 52% lymphocytes (12% B lymphocytes, 40% T lymphocytes {25% CD4+ and 15% 
CD8+}), 20% NK cells, 25% monocytes, and 3% various cells that include dendritic cells and 
progenitor cells. These cells were pooled from the blood of 6 healtiiy volunteer donors. The 
expression of SEQ ID NO:35 was decreased by at least two-fold in prednisone-treated (5 and 25 (jM) 
' cells as compared to untreated controls. 

20 SEQ ID NO:44 showed differential expression in normal tissue versus tissue affected by 

prostate carcinoma by microarray analysis. Expression of SEQ ID NO:44 in a primary prostate 
epithelial cell line (PrEC) isolated from a normal donor was compared to expression of SEQ ID 
NO:44 in a prostate carcinoma cell line isolated (LNCaP) from a lymph node biopsy of a 50-year-old 
male witii metastatic prostate carcinoma. Expression of SEQ ID NO;44 was decreased by at least 

25 two-fold in the cell line affected by prostate carcinoma. In addition, expression of SEQ ID NO:44 in 
peripheral blood mononuclear cells isolated from a pool of healthy donors was decreased by at least 
two-fold by treatment with 1 ng/ml IL12 for 24hours. The expression of SEQ ID NO:44 was also 
shown to be differentially expressed in a comparison of breast cells lines by microarray analysis. In 
four of the seven breast cancer cell lines tested, expression of SEQ ID NO:44 was shown to be 

30 decreased by at least two-fold when compared to noimal mammary epithelial cells (HMEC), 

indicating the use of SEQ ID NO:44 as a diagnostic marker, for disease staging, and as a therapeutic 
target for protease-associated diseases including prostate and breast cancer. 

For example, SEQ ID NO:51 showed differential expression in toxicology studies as 
determined by microarray analysis. The expression of SEQ ID NO:51 was decreased by at least two 

35 fold in a human C3A hver cell line treated with various drugs (e.g., steroids, steroid hormones) 

98 



wo 03/000844 



PCT/US02/19360 



relative to untreated C3A cells. The human C3A cell line is a clonal derivative of HepG2/C3 
(hepatoma cell line, isolated from a 15-year-old male v^ith liver tumor), which was selected for strong 
contact inhibition of growth. The C3 A cell line is well established as an in vitro model of the mature 
human liver (Mickelson et al. (1995) Hepatology 22:866-875; Nagendra et al. (1997) Am J Physiol 
5 272:G408-G416). Effects upon liver metaboUsm are important to understanding the 

pharmacodynamics of a drug. Therefore, SEQ ID NO: 51 can be used for understanding the 
pharmacodynamics of a drug. 

In another example, SEQ ID NO:55 showed differential expression in lung adenocarcinoma 
versus normal lung tissues as determined by microarray analysis. The expression of SEQ ID NO:55 

10 was decreased by at least two fold in lung adenocarcinoma relative to grossly uninvolved normal lung 
tissue from the same donor. Therefore, SEQ ID NO:55 can be used as a diagnostic marker for disease 
staging or as a potential therapeutic target for lung cancer. 

As another example, SEQ ID NO:56 is downregulated in breast cancer cell lines versus 
nonmalignant mammary epithelial cells, as determined by UMcroarray analysis. In one experiment, 

15 gene expression profiles of nonmalignant mammary epithelial cells were compared to gene 

expression profiles of various breast carcinoma lines at different stages of tumor progression. The 
cells were grown in defined serum-free H14 medium to 70-80% confluence prior to RNA harvest. 
Cell lines compared included: a) HMEC, a primary breast epithelial cell line isolated from a normal 
donor, b)MCF-10A, a breast mammary gland cell line isolated from a 36-year-old woman with 

20 fibrocystic breast disease, c)MCF7, a nonmalignant breast adenocarcinoma cell line isolated from the 
pleural effusion of a 69- year-old female, d)T-47D, a breast carcinoma ceU line isolated from a 
pleural effusion obtained from a 54-year-old female with an infiltrating ductal carcinoma of the 
breast, e)Sk-BR-3, a breast adenocarcinoma cell line isolated from a malignant pleural effusion of a 
43-year-old female, f)BT-20, a breast carcinoma cell line derived in vitro from cells emigrating out of 

25 thin slices of the tumor mass isolated from a 74-year-old female, g)MDA-mb-231, a breast tumor cell 
line isolated from the pleural effusion of a 51-year-old female, and h)MDA-mb-435S, a spindle- 
shaped strain that evolved from the parent line (435) isolated by R. Cailleau from pleural effusion of a 
31-year-old female with metastatic, ductal adenocarcinoma of the breast. Therefore, SEQ ID NO:56 
can be used in monitoring treatment of, and diagnostic assays for, breast cancer. 

30 As another example, SEQ ID NO: 56 is downregulated in prostate carcinomas versus primary 

prostate epithelial cells, as determined by microarray analysis. Primary prostate epithelial cells were 
compared with prostate carcinomas representative of the different stages of tumor progression. Cell 
lines compared included: a) PrEC, a primary prostate epithelial cell line isolated from a normal donor, 
b) DU 145, a prostate carcinoma cell line isolated from a metastatic site in the brain of 69-year old 

35 male with widespread metastatic prostate carcinoma, c) LNCaP, a prostate carcinoma cell line 
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isolated from a lymph node biopsy of a 50-year-old male with metastatic prostate carcinoma, and 
d)PC-3, a prostate adenocarcinoma cell line isolated from a metastatic site in the bone of a 62-year- 
old male with grade IV prostate adenocarcinoma. In one experiment, cells were grown in basal media 
in the absence of growth factors and hormones. In a second experiment, cells were grown under 
5 optimal growth conditions, in the presence of growth factors and nutrients. Cells grown under 

restrictive conditions were compared to normal PrECs grown under restrictive conditions. Therefore, 
SEQ ID NO: 56 can be used in monitoring treatment of, and diagnostic assays for, prostate cancer. 
^11. Complementary Polynucleotides 

Sequences complementary to the PMOD-encoding sequences, or any parts thereof, are used 

10 to detect, decrease, or inhibit expression of naturally occurring PMOD. Although use of 
oligonucleotides con^rising from about 15 to 30 base pairs is described, essentially the same 
procedure is used with smaller or with larger sequence fragments. Appropriate oligonucleotides are 
designed using OLIGO 4.06 software (National Biosciences) and the coding sequence of PMOD. To 
inhibit transcription, a complementary oligonucleotide is designed from the most unique 5' sequence 

15 and used to prevent promoter binding to the coding sequence. To inhibit translation, a 

complementary oligonucleotide is designed to prevent ribosomal binding to the PMOD-encoding 
transcript. 

Xin. Expression of PMOD 

Expression and purification of PMOD is achieved using bacterial or virus-based expression 

20 systems. For expression of PMOD in bacteria, cDNA is subcloned into an appropriate vector 

containing an antibiotic resistance gene and an inducible promoter that directs high levels of cDNA 
transcription. Examples of such promoters include, but are not limited to, the trp-lac (tac) hybrid 
promoter and the T5 or T7 bacteriophage promoter in conjunction with the lac operator regulatory 
element. Recombinant vectors are transformed into suitable bacterial hosts, e.g., BL2'1(DE3). 

25 Antibiotic resistant bacteria express PMOD upon induction with isopropyl beta-D- 

thiogalactc^yranoside (IPTG). Expression of PMOD in eukaryotic cells is achieved by mfecting 
insect or mammalian cell lines with recombinant Autographica califomica nuclear polyhedrosis virus 
(AcMNPV), commonly known as baculovirus. The nonessential polyhedrin gene of baculovirus is 
replaced with cDNA encodmg PMOD by either homologous recombination or bacterial-mediated 

30 transposition involving transfer plasmid intermediates. Viral infectivity is maintained and the strong 
polyhedrin promoter drives high levels of cDNA transcription. Recombinant baculovirus is used to 
infect Spodopter a frugiperda (Sf9) insect cells in most cases, or human hepatocytes, in some cases. 
Infection of the latter requires additional genetic modifications to baculovirus. (See Engelhard, E.K. 
et al. (1994) Proc. Natl. Acad. Sci. USA 91:3224-3227; Sandig, V. et al. (1996) Hum. Gene Ther. 

35 7:1937-1945.) 
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In most expression systems, PMOD is synthesized as a fusion protein with, e.g., glutathione 
S-transferase (GST) or a peptide epitope tag, such as FLAG or 6-His, permitting rapid, smgle-step, 
affinity-based purification of recombinant fusion protein from crude cell ly sates. GST, a 26- 
kilodalton enzyme from Schistosoma iaponicum . enables the purification of fusion proteins on 
5 immobilized glutathione under conditions that maintain protein activity and antigenicity (Amersham 
Biosciences). Following purification, the GST moiety can be proteolytically cleaved from PMOD at 
specifically engineered sites. FLAG, an 8-amino acid peptide, enables immunoaffmity purification 
using commercially available monoclonal and polyclonal anti-FLAG antibodies (Eastman Kodak). 6- 
His, a stretch of six consecutive histidine residues, enables purification on metal-chelate resins 

10 (QIAGEN). Methods for protein expression and purification are discussed in Ausubel (1995, supra, 
ch. 10 and 16). Purified PMOD obtained by these methods can be used directly in the assays shown 
in Examples XVII, XVHI, and XIX, where applicable. 
XIV. Functional Assays 

PMOD function is assessed by expressing the sequences encoding PMOD at physiologically 

15 elevated levels in mammalian cell culture systems. cDNA is subcloned into a mammalian expression 
vector containing a strong promoter that drives high levels of cDNA expression. Vectors of choice 
include PCMV SPORT plasmid (Invitrogen. Carlsbad CA) and PCR3.1 plasmid (fovitrogen), both of 
which contain the cytomegalovirus promoter. 5-10 fxg of recombinant vector are transiently 
transfected into a human cell line, for example, an endothelial or hematopoietic cell line, using either 

20 liposome formulations or electroporation, 1-2 ^ig of an additional plasmid containing sequences 
encoding a marker protein are co-transfected. Expression of a marker protein provides a means to r 
distinguish transfected cells from nontransfected cells and is a reliable predictor of cDNA expression 
from the recombinant vector. Marker proteins of choice include, e.g., Green Fluorescent Protein 
(GFP; Clontech), CD64, or a CD64-GFP fusion protein. Flow cytometry (FCM), an automated, laser 

25 optics-based technique, is used to identify transfected cells expressing GFP or CD64-GFP and to 
evaluate the apoptotic state of the cells and other cellular properties. FCM detects and quantifies the 
uptake of fluorescent molecules that diagnose events preceding or coincident with cell death. These 
events include changes in nuclear DNA content as measured by staining of DNA with propidium 
iodide; changes in cell size and granularity as measured by forward light scatter and 90 degree side 

30 light scatter; down-regulation of DNA synthesis as measured by decrease in bromodeoxyuridine 
uptake; alterations in expression of cell surface and intracellular proteins as measured by reactivity 
with specific antibodies; and alterations in plasma membrane composition as measured by the binding 
of fluorescein-conjugated Annexin V protein to the cell surface. Methods in flow cytometry are 
. discussed in Ormerod, M.G. (1994) Flow Cytometry . Oxford, New York NY. 
. 35 The influence of PMOD on gene expression can be assessed using highly purified 
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populations of cells transfeeted with sequences encoding PMOD and either CD64 or CD64-GFP. 
CD64 andCD64-GFP are expressed on the surface of transfeeted cells and bind to conserved regions 
of human inununoglobulin G (IgG). Transfeeted cells are efficiently separated from nontransfected 
cells using magnetic beads coated with either human IgG or antibody against CD64 (DYNAL, Lake 
5 Success NY). mRNA can be piuified from.the cells using methods weD known by those of sldU m 
the art. Expression of mRNA encoding PMOD and other genes of interest can be analyzed by 
northem analysis or microarray techniques. 
XV. Production of PMOD Specific Antibodies 

PMOD substantially purified using polyacrylamide gel electrophoresis (PAGE; see, e.g., 
10 Harrington, M.G. (1990) Methods Enzymol. 182:488-495), or other purification techniques, is used to 
immunize animals (e.g., rabbits, inice, etc.) and to produce antibodies using standard protocols. 

Altematively, tlie PMOD amino acid sequence is analyzed using LASERGENE software 
(DNASTAR) to determine regions of high immunogenicity, and a corresponding oligopeptide is 
synthesized and used to raise antibodies by means known to those of skill in the art. Methods for 
15 selection of appropriate epitopes, such as those near the C-terminus or in hydrophilic regions are well 
described in the art. (See, e.g., Ausubel, 1995, supra, ch. 11.) • 

Typically, oligopeptides of about 15 residues in length are synthesized using an ABI 431 A 
peptide synthesizer (Applied Biosystems) using FMOC chemistry and coupled to KLH (Sigma- 
. Aldrich, St. Louis MO) by reaction with N-maleimidobenzoyl-N-hydroxysuccinimide ester (MBS) to 
20 increase immunogenicity. (See, e.g., Ausubel, 1995, supra,) Rabbits are immunized with the 
oligopeptide-KLH conq)lex in complete Freund's adjuvant. Resulting antisera are tested for 
antipeptide and anti-PMOD activity by, for example, bindmg the peptide or PMOD to a substrate, 
blocking with 1% BSA, reacting with rabbit antisera, washing, and reacting with radio-iodinated goat 
anti-rabbit IgG. 

25 XVL Purification of Naturally Occurring PMOD Using Specific Antibodies 

Naturally occurring or recombinant PMOD is substantially purified by immunoaffinity 
chromatography using antibodies specific for PMOD. An immunoaffinity column is constructed by 
covalently coupling anti-PMOD antibody to an activated chromatographic resin, such as 
CNBr-activated SEPHAROSE (Amersham Biosciences). After the coupling, the resin is blocked and 

30 washed according to the manufacturer's instructions. 

Media containing PMOD are passed over the immunoaffinity column, and the column is 
washed under conditions that allow the preferential absorbance of PMOD (e.g., high ionic strength 
buffers in the presence of detergent). The column is eluted under conditions that disrupt 
antibody/PMOD binding (e.g., a buffer of pH 2 to pH 3, or a high concentration of a chaotrope, such 

35 as urea or thiocyanate ion), and PMOD is collected. 
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XVIL Identification of Molecules Which Interact with PMOD 

PMOD, or biologically active fragments thereof, are labeled with ^^I Bolton-Hunter reagent. 
(See, e.g., Bolton, A.E. and W.M. Hunter (1973) Biochem. J. 133:529-539.) Candidate molecules 
previously arrayed in the v^ells of a multi-well plate are incubated with the labeled PMOD, washed, 
5 and any wells with labeled PMOD complex are assayed. Data obtained using different concentrations 
of PMOD are used to calculate values for the number, affinity, and association of PMOD with the 
candidate molecules. 

Alternatively, molecules interacting with PMOD are analyzed using the yeast two-hybrid 
system as described in Fields, S, and O. Song (1989) Nature 340:245-246, or using conunercially 
10 available kits based on the two-hybrid system, such as the MATCHMAKER system (Clontech). 

PMOD may also be used in the PATHCALLING process (CuraGen Corp., New Haven CT) 
which employs the yeast two-hybrid system in a high-throughput nnanner to determine all interactions 
between the proteins encoded by two large libraries of genes (Nandabalan, K. et al. (2000) U.S. 
Patent No. 6,057,101). 
15 XVIL Demonstration of PMOD Activity 

PMOD activity can be demonstrated using a generic immunoblotting strategy or through a 
variety of specific activity assays, some of which are outlined below. As a general approach, cell 
lines or tissues transformed with a vector containing PMOD coding sequences can be assayed for 
PMOD activity by immunoblotting. Transformed cells are denatured in SDS in the presence of (3- 
20 mercaptoethanol, nucleic acids are removed by ethanol precipitation, and proteins are purified by 
acetone precipitation. Pellets are resuspended in 20 mM Tris buffer at pH 7.5 and incubated with 
Protein G-Sepharose pre-coated with an antibody specific for PMOD. After washing, the Sepharose 
beads are boiled in electrophoresis sample buffer, and the eluted proteins subjected to SDS-PAGE. 
The SDS-PAGE is transferred to a membrane for inmiunoblotting, and the PMOD activity is assessed 
25 by visuahzing and quantifying bands on the blot using the antibody specific for PMOD as the primary 
- antibody and ^^^I-labeled IgG specific for the primary antibody as the secondary antibody. 

PMOD kinase activity is measured by quantifying the phosphorylation of a protein substrate 
by PMOD in the presence of gamma-labeled ^^P-ATP. PMOD is incubated with tiie protein substrate, 
^^P-ATP, and an appropriate kinase buffer. The ^^P incorporated into the substrate is separated from 
30 free ^^P-ATP by electrophoresis and the incorporated ^"P is counted using a radioisotope counter. The 
amount of incorporated ^^P is proportional to the activity of PMOD. A determination of the specific 
amino acid residue phosphorylated is made by phosphoamino acid analysis of the hydrolyzed protein, 

PMOD phosphatase activity is measured by the hydrolysis of P-nitrophenyl phosphate 
(PNPP). PMOD is incubated together with PNPP in HEPES buffer pH 7.5, m the presence of 0.1% p- 
35 mercaptoethanol at 37 ''C for 60 min. The reaction is stopped by the addition of 6 ml of 10 N NaOH ' 
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and the increase in light absorbance at 410 nm resulting from the hydrolysis of PNPP is measured 
using a spectrophotometer. The increase in light absorbance is proportional to the activity of PMOD 
in the assay (Diamond, R.H. et al. (1994) MoL Cell. Biol. 14:3752-62). 

The assay for SEQ ID NO: 10 is carried out as described above for PMOD using a cysteine 
5 protease, such as papain, assayed in the absence and in the presence of various concentrations of SEQ 
ID NO: 10. Inhibition of papain protease activity is proportional to the activity of SEQ ID NO: 10 in 
the assay. 

The assay for SEQ ID NO: 1 1 is carried out as described above for PMOD using matrix 
metalloproteinase assayed in the absence and in the presence of various concentrations of SEQ ID 
10 NO: 1 1 . Inhibition of matrix metalloproteinase activity is proportional to the activity of SEQ ID 
N0:11' in the assay. 

In the alternative, PMOD phosphatase activity is determined by measuring the amount of 
phosphate removed from a phosphorylated protein substrate. Reactions are performed with 2 or 4 nM 
enzyme in a final volimae of 30 fil containing 60 mM Tris, pH 7.6, 1 mM EDTA, 1 mM EGTA, 0. 1% 

15 2-mercaptoethanol and 10 [jtM substrate, ^^P-labeled on serine/threonine or tyrosine, as appropriate. 
Reactions are initiated with substrate and incubated at 30° C for 10-15 min. Reactions are quenched 
with 450 fil of 4% (w/v) activated charcoal in 0.6 M HCl, 90 mM Na4P207, and 2 mM NaH2P04, then 
centrifuged at 12,000 x g for 5 min. Acid-soluble ^^Pi is quantified by liquid scintillation counting 
(Sinclair, C. et al. (1999) J. Biol. Chem. 274:23666-23672). 

20 PMOD protease activity is measured by the hydrolysis of appropriate synthetic peptide 

substrates conjugated with various chromogenic molecules in which the degree of hydrolysis is 
quantified by spectrophotometric (or fluorometric) absorption of the released chromophore (Beynon, 
R.J. and J.S. Bond (1994) Proteolytic Enzymes: A Practical Approach. Oxford University Press, New 
York, NY, pp.25~55). Peptide substrates are designed according to the category of protease activity 

25 as endopeptidase (serine, cysteine, aspartic proteases, or metalloproteases), aminopeptidase (leucine 
aminopeptidase), or carboxy peptidase (carboxy peptidases A and B, procollagen C-proteinase). 
Commonly used chromogens are 2-naphthylamine, 4-nitroaniline, and furylacrylic acid. Assays are 
performed at ambient temperature and contain an aliquot of the enzyme and the appropriate substrate 
in a suitable buffer. Reactions are carried out in an optical cuvette, and the increase/decrease in 

30 absorbance of the chromogen released during hydrolysis of the peptide substrate is measured. The 
change in absorbance is proportional to the enzyme activity in the assay. 

In the alternative, an assay for PMOD protease activity takes advantage of fluorescence 
resonance energy transfer (FRET) that occurs when one donor and one acceptor fluorophore with an 
appropriate spectral overlap are in close proximity. A flexible peptide linker containing a cleavage 

35 site specific for PMOD is fused between a red-shifted variant (RSGFP4) and a blue variant (BFP5) of 
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Green Fluorescent Protein, This fusion protein has spectral properties that suggest energy transfer is 
occurring from BFP5 to RSGFP4. When the fusion protein is incubated with PMOD, the substrate is 
cleaved, and the two fluorescent proteins dissociate. This is accompanied by a marked decrease in 
energy transfer which is quantified by comparing the emission spectra before and after the addition of 
5 PMOD (Mitra, R.D. et al (1996) Gene 173: 13-17). This assay can also be performed in living cells. 
In this case the fluorescent substrate protein is expressed constitutively in cells and PMOD is 
introduced on an inducible vector so that FRET can be monitored in the presence and absence of 
PMOD (Sagot, 1. et al (1999) FEES Letters 447:53-57). 

An assay for ubiquitin hydrolase activity measures the hydrolysis of a ubiquitin precursor. 

10 The assay is performed at ambient temperature and contauis an aliquot of PMOD and the appropriate 
substrate in a suitable buffer. Chemically synthesized human ubiquitin-valine may be used as 
substrate. Cleavage of the C-terminal valine residue from the substrate is monitored by capillary 
electrophoresis (Franklin, K. et al. (1997) Anal. Bioehem. 247:305-309). 

PMOD protease inhibitor activity for alpha 2-HS-glycoprotein (AHSG) can be measured as a 

15 decrease in osteogenic activity in dexamethasone-treated rat bone marrow cell cultures (dex-RBMC). 
Assays are carried out in 96-well culture plates containing minimal essential medium supplemented 
with 15% fetal bovine serum, ascorbic acid (50 M-g/ml), antibiotics (100 M-g/ml penicillin G, 50 fxg/ml 
gentamicin, 0.3 jig/ml fungizone), 10 mM B -glycerophosphate, dexamethasone (10'^ M) and various 
concentrations of PMOD for 12-14 days. Mineralized tissue formation in the cultures is quantified 

20 by measuring the absorbance at 525 nm using a 96-well plate reader (Binkert, C. et al. supra ). 

PMOD protease inhibitor activity for inter-alpha-trypsin inhibitor (ITI) can be measured by a 
continuous spectrophotometric rate determination of trypsin activity. The assay is performed at 
ambient temperature in a quartz cuvette in pH 7.6 assay buffer containing 63 mM sodium phosphate, 
0.23 mM N a-benzoyle-L-arginine ethyl ester, 0.06 mM hydrochloric acid, 100 units trypsin, and 

25 various concentrations of PMOD. Immediately after mixing by inversion, the increase in A 253 nm is 
recorded for approximately 5 minutes and the enzyme activity is calculated (Bergmeyer, H.U. et al. 
(1974) Meth. Enzym. Anal. 1:515-516) 

PMOD isomerase activity such as peptidyl prolyl cis/trans isomerase activity can be assayed 
by an enzyme assay described by Rahfeld, lU., et al. (1994) (FEES Lett. 352: 180-184). The assay 

30 is performed at 10 °C in 35 mM HEPES buffer, pH 7.8, containing chymotrypsin (0.5 mg/ml) and 
PMOD at a variety of concentrations. Under these assay conditions, the substrate, Suc-Ala-Xaa-Pro- 
Phe-4-NA, is in equilibrium with respect to the prolyl bond, with 80-95% in trans and 5-20% in cis 
conformation. An aliquot (2 ul) of the substrate dissolved in dimethyl sulfoxide (10 mg/ml) is added 
to the reaction mixture described above. Only the cw isomer of the substrate is a substrate for 

35 cleavage by chymotrypsin. Thus, as the substrate is isomerized by PMOD, the product is cleaved by 
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chymotrypsin to produce 4-nitroanilide, which is detected by it' s absorbance at 390 nm. 4- 
Nitroanilide appears in a time-dependent and a PMOD concentration-dependent manner. 

PMOD galactosyltransferase activity can be determined by measuring the transfer of 
radiolabeled galactose from UDP-galactose to a GlcNAc-terminated oligosaccharide chain 
5 (Kolbinger, F. et al. (1998) J. Biol. Chem. 273:58-65). The sample is incubated with 14 jil of assay 
stock solution (180 mM sodium cacodylate, pH 6.5, 1 mg/ml bovine serum albumin, 0.26 mM UDP- 
galactose, 2 /xl of UDP-PH] galactose), 1 jul of MnCl, (500 mM), and 2.5 ill of GlcNAcpO-(CH,)g- 
COjMe (37 mg/ml m dimethyl sulfoxide) for 60 minutes at 37 °C. The reaction is quenched by the 
addition of 1 ml of water and loaded on a CIS Sep-Pak cartridge (Waters), and the column is washed 

10 twice with 5 ml of water to remove unreacted UDP-[TTlgalactose. The ^HJgalactosylated 

GlcNAcP0-(CH2)8-C02Me remains bound to the column during the water washes and is eluted with 5 
ml of methanol. Radioactivity in the eluted material is measured by liquid scintillation counting and 
is proportional to galactosyltransferase activity in the starting sample. 

PMOD induction by heat or toxins may be demonstrated using primary cultures of human 

15 fibroblasts or human cell lines such as CCL-13, HEK293, or HEP 02 (ATCC). To heat mduce 

PMOD expression, aliquots of cells are incubated at 42 **C for 15, 30, or 60 mmutes. Control aliquots 
are incubated at 37 °C for the same tune periods. To induce PMOD expression by toxins, aliquots of 
cells are treated with 100 fiM arsenite or 20 mM azetidine-2-carboxylic acid for 0, 3, 6, or 12 hours. 
After exposure to heat, arsenite, or the amino acid analogue, samples of the treated cells are harvested 

20 and cell ly sates prepared for analysis by westem blot. Cells are lysed in lysis buffer containing 1% 
Nonidet P-40, 0.15 M NaCl, 50 mM Tris-HCl, 5 mM EDTA, 2 mM N-ethyhnaleimide, 2 mM 
phenylmethylsulfonyl fluoride, 1 mg/ml leupeptin, and 1 mg/ml pepstatin. Twenty micrograms of 
the cell lysate is separated on an 8% SDS-PAGE gel and transferred to a membrane. After blocking 
• with 5% nonfat dry milk/phosphate-buffered saline for 1 h, the membrane is incubated overnight at 

25 4°C or at room temperature for 2-4 hours with an appropriate dilution of anti-PMOD serum in 2% 
nonfat dry milk/phosphate-buffered saline. The membrane is then washed and incubated with a 
1:1000 dilution of horseradish peroxidase-conjugated goat anti-rabbit IgG in 2% dry 
milk/phosphate-buflFered saline. After washing with 0.1% Tween 20 in phosphate-buffered saline, 
the PMOD protein is detected and compared to controls using chemiluminescence. 

30 PMOD lysyl hydroxylase activity is determined by measuring the production of 

hydroxyE^'^CJlysine from [^"^CJlysine. Radiolabeled protocoUagen is incubated with PMOD in buffer 
containing ascorbic acid, iron sulfate, dithiothreitol, bovine serum albuimn, and catalase. Following a 
30 minute incubation, the reaction is stopped by the addition of acetone, and centrifuged. The 
sedunented material is dried, and the hydroxy[^'^C]lysine is converted to [^"^Qlformaldehyde by 

35 oxidation with periodate, and then extracted into toluene. The amount of ^*C extracted into toluene is 
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quantified by scintillation counting, and is proportional to the activity of PMOD in the sample 
(Kivuikko, K., and Myllyla, R. (1982) Methods Enzymol. 82:245-304). 
XVIIL Identification of PMOD Substrates 

Phage display libraries can be used to identify optimal substrate sequences for PMOD. A 
random hexamer followed by a linker and a known antibody epitope is cloned as an N-terminal 
extension of gene HI in a filamentous phage library. Gene HI codes for a coat protein, and the epitope 
will be displayed on the surface of each phage particle. The library is incubated with PMOD under 
proteolytic conditions so that the epitope will be removed if the hexamer codes for a PMOD cleavage 
site. An antibody that recognizes the epitope is added along with immobilized protein A. Uncleaved 
phage, which still bear the epitope, are removed by centrifugation. Phage in the supernatant are then 
amplified and undergo several more rounds of screening. Individual phage clones are then isolated 
and sequenced. Reaction kinetics for these peptide substrates can be studied using an assay in 
Example XVn, and an optimal cleavage sequence can be derived (Ke, S.H. et al. (1997) J. Biol. 
Chem. 272:16603-16609). 

To screen for in vivo PMOD substrates, this method can be expanded to screen a cDNA 
expression library displayed on the surface of phage particles (T7SELECT™10-3 Phage display 
vector, Novagen, Madison, WI) or yeast cells (pYDl yeast display vector kit, Invitrogen, Carlsbad, 
CA). In this case, entire cDNAs are fused between Gene HI and the appropriate epitope. 
XIX. Identification of PMOD Inhibitors 

Compounds to be tested are arrayed in the wells of a multi-well plate in varying 
concentrations along with an appropriate buffer and substrate, as described in the assays in Example 
XVn. PMOD activity is measured for each well and the ability of each compound to inhibit PMOD 
activity can be determined, as well as the dose-response kinetics. This assay could also.be used to 
identify molecules which enhance PMOD activity. 

In the alternative, phage display Ubraries can be used to screen for peptide PMOD inhibitors. 
Candidates are found among peptides which bind tighdy to a protease. In this case, multi-well plate 
wells are coated with PMOD and incubated with a random peptide phage display library or a cyclic 
peptide library (Koivunen, E. et al. (1999) Nature Biotech 17:768-774). Unbound phage are washed 
away and selected phage amplified and rescreened for several more rounds. Candidates are tested for 
PMOD inhibitory activity using an assay described in Example XVn. 

Various modifications and variations of the described compositions, methods, and systems of 
the invention will be apparent to those skilled in the art without departing from the scope and spirit of 
the invention. It will be appreciated that the invention provides novel and useful proteins, and their 
encoding polynucleotides, which can be used in the drug discovery process, as well as methods for 
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using these compositions for the detection, diagnosis, and treatment of diseases and conditions. 
Although the invention has been described in connection with certain embodiments, it should be 
understood that the invention as claimed should not be unduly limited to such specific embodiments. 
Nor should the description of such embodiments be considered exhaustive or limit the invention to 
5 the precise forms disclosed. Furthermore, elements from one embodiment can be readily recombined 
with elements from one or more other embodiments. Such combinations can form a nimiber of 
embodiments within the scope of the invention. It is intended that the scope of the invention be 
defined by the following claims and their equivalents. 
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Table 1 



Incyte Project ID 


Polypeptide 


Incyte 


Polynucleotide 


Incyte 






SEQID 


Polypeptide ID 


SEQID NO: 


Polynucleotide 


Incyte Full 




NO: 






ID 


Length Clones 


7994355 


1 


7994355CD1 


29 


7994355CB1 




7475875 


2 


7475875CD1 


30 


7475875CB1 




71231882 


3 


71231882CD1 


31 


71231882CB1 




2875922 


4 


2875922CD1 


32 


2875922CB1 


90129891CA2 


8158136 


5 


8158136CD1 


33 


8158136CB1 


90D38744CA2 


5969491 


6 


5969491CD1 


34 


596949 ICBl 




7497367 


7 


7497367CD1 


35 


7497367CB1 


90092215CA2 


7632424 


8 


7632424CD1 


36 


7632424CB1 




1804436 


9 


1804436CD1 


37 


1804436CB1 




7486358 


10 


7486358CD1 


38 


7486358CB1 




7472344 


11 


7472344CD1 


39 


7472344CB1 




7192959 


12 


7192959CD1 


40 


7192959CB1 




6169565 


13 


6169565CD1 


41 


6169565CB1 


6169565CA2 


7494717 


14 


7494717CD1 


42 


7494717CB1 




7497510 


15 


74975 lOCDl 


43 


74975 lOCBl 


90166158CA2, 












90166182CA2, 












90166190CA2, 












90166282CA2, 












90166290CA2, 












90189432CA2, 












90189440CA2, 












90189464CA2, 












90189480CA2, 












901895 16CA2, 












901S9532CA2, 












90189548CA2, 












90189596CA2, 












90189833CA2, 












90189849CA2, 












90189857CA2, 












90189865CA2, 












9018988 1CA2, 












90189933CA2, 












90189941CA2, 












90189949CA2, 












90189957CA2. 












90189981CA2, 












90189989CA2, 












901901 89CA2 


7498882 


16 


7498882CD1 


44 


7498882CB1 




5524205 


17 


5524205CD1 


45 


5524205CB1 




7102342 


18 


7102342CD1 


46 


7102342CB1 




4169939 


19 . 


4169939CD1 


47 


4169939CBI 
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Table 1 



Incyte Project ID 


Polypeptide 


Incyte 


Polynucleotide 


Incyte 






SEQID 


Polypeptide ID 


SEQID NO: 


Polynucleotide 


Incyte Full 




NO: 






ID 


Length Clones 


6539977 


20 


6539977CD1 


48 


6539977CB1 


90188738CA2, 
90188893CA2, 
95003737CA2, 
95003761CA2, 
95003853CA2, 
95003869CA2, 
95003905CA2, 
95003913CA2, 
95003969CA2, 
95004005CA2, 
95004061CA2, 
95004069CA2 


7675588 


21 


7675588CD1 


49 


7675588CB1 


4213559CA2, 
90166613CA2 


6244077 


22 


6244077CD1 


50 


6244077CB1 


90110106CA2, 
90110114CA2, 
90110154CA2, 
90110162CA2, 
90110170CA2, 
90110186CA2, 
90110194CA2, 
90110270CA2, 
90110278CA2, 
90110286CA2, 
90110294CA2, 
90110470CA2 


7498404 


23 


7498404CD1 


51 


7498404CB1 




7391748 . 


24 


7391748CD1 


52 


7391748CB1 
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53 
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26 


7499881CD1 


54 


7499881CB1 




7488579 


27 


7488579CD1 


55 


7488579CB1 




7510521 


28 


7510521CD1 


56 


7510521CB1 
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cn 



Analytical Methods 
and Databases 


HMMER^PFAM 1 


HMMER_^PFAM | 


BLIMPS_PFAM . 1 


CO 

m 


BLAST_PRODOM 


O 

o 

CO 
< 


MOTIFS 

i 


MOTIFS 1 


TMAP 1 


BLIMPS.BLOCKS | 


u 

1 


MOTIFS 


MOTIFS 1 


MOTIFS 


MOTIFS 


p: 


. 


HMMER_PFAM 


BLIMPS_BLOCKS 


BLAST_PRODOM 


BLAST_DOMO 


Signature Sequences, Domains and Motifs 


HECT-domain (ubiquitin-transferase): D467-A774 


Ank repeat: V26-I58, S59-K92, N93-II25 


Ank repeat proteins; PF00023: L31-L46, G60-R69 


iHECT-domain (ubiquitin-transferase): PF00632: V586-G592, W680-P707, L735-C766 1 


PROTEIN LIGASE UBIQUITIN CONJUGATION REPEAT UBIQUITIN D20PROTEIN DNA 
BINDING PROBABLE ONCOGENIC PD002225: P492-Y771 


HECT DOMAIN DM01690; P39940|5 13-808: N475-L764; P51593|9-306: N475-Y771; 
A38919|785-1082: N475-A774; P531 191615-909: N475-S768 


Potential Phosphorylation Sites: S7 S15 S159 S176 S231 S250 S291 S338 S430 S669 S703 T96 
T128T216 T689 Y397 


Potential Glycosylation Sites: N283 N722 


Transmembrane domain: V113-A129, A141-V169 N-terminus is non-cytosolic 


Tyrosine specific protein phosphatase BL00383: 1137-G147, R170-C185 


Tyrosine specific protein phosphatases signature and profiles: L116-P174 


Tyrosine specific protein phosphatases active site: I137-L149 


Leucine zipper pattern: L333-L354 1 


Potential Phosphorylation Sites: D18S21 S201 S341 S343 S448 S486 S496 S503 S594 T118 
T272 T493 T605 T679 T687 T695 


Potential Glycosylation Sites: N550 


Ubiquitin carboxyl-terminal hydrolases family 1: S41-L72 


Ubiquitin carboxyl-terminal hydrolase family 2: L285-K482 


Ubiquitin carboxyl-terminal hydrolases family 2 proteins BL00972: G42-L59, M129-N138, 1158- 
C172, 1288-D3I2, C446-Q467 


PROTEASE UBIQUITIN HYDROLASE UBIQUITINSPECIFIC ENZYME 
DEUBIQUITINATING CARBOXYLTERMINAL TfflOLESTERASE PROCESSING 
CONJUGATION 


UBIQUITIN CARBOXYL-TERMINAL HYDROLASES FAMILY 2; DM00659|P50101|209- 
458: N45-G299 


Amino Acid 
Residues 


















cn 
o 














1256 










Incyte 

Polypeptide 

ID 


7994355CDI 
















7475875CD1 














71231882CD1 
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Analytical Methods 
and Databases 


HMMER_PFAM 1 


HMMER.PFAM 1 


BLIMPS_PFAM 1 


BLIMPS PFAM 


BLAST_.PRODOM 


BLAST^DOMO 


MOTIFS 


Imotifs 1 


TMAP 


CO 

a 
o 

hJ 

PQ 

\ 

CO 

hJ 


PROFILESCAN 


MOTIFS 1 


MOTIFS 


MOTIFS 


MOTIFS 


HMMER PFAM 1 


1 


HMMER PFAM 


BLIMPS_BLOCKS 


BLAST_PRODOM 


BLAST_DOMO 


Signature Sequences, Domains and Motifs 


HECT-domain (ubiquitin-transferase): D467-A774 


Ank repeat: V26-I58, S59-K92, N93-I125 


Ank repeat proteins; PF00023: L3 1-L46, G60-R69 


HECT-domain (ubiquitin-transferase): PF00632: V586-G592, W680-P707, L735-C766 


PROTEIN LIGASE UBIQUITIN CONJUGATION REPEAT UBIQUITIN D20PROTEIN DNA 
BINDING PROBABLE ONCOGENIC PD002225: P492-Y771 


HECT DOMAIN DM01690; P39940|5 13-808: N475-L764; P51593|9-306: N475-Y771; 
A38919I785-1082: N475-A774; P53119|615-909: N475-S768 


Potential Phosphorylation Sites: S7 S15 S159 S176 S231 S250 S291 S338 S430 S669 S703 T96 
T128T216T689 Y397 


Potential Glycosylation Sites: N283 N722 


Transmembrane domain: V113-A129, A141-V169 N-terminus is non-cytosolic 


Tyrosine specific protein phosphatase BL00383: 1137-G147, R170-C185 


Tyrosine specific protein phosphatases signature and profiles: L116-P174 


Tyrosine specific protein phosphatases active site: I137-L149 


Leucine zipper pattern: L333-L354 


Potential Phosphorylation Sites: D18S21 S201 S341 S343 S448 S486 S496 S503 S594 T118 
T272 T493 T605 T679 T687 T695 


Potential Glycosylation Sites: N550 


Ubiquitin carboxyl-terminal hydrolases family 1: S41-L72 


Ubiquitin carboxyl-terminal hydrolase family 2: L285-K482 


Ubiquitin carboxyl-terminal hydrolases family 2 proteins BL00972: G42-L59, M129-N138, 1158- 
C172, 1288-D312, C446-Q467 


PROTEASE UBIQUITIN HYDROLASE UBIQUITINSPECMC ENZYME 
DEUBIQUITINATING CARBOXYLTERMINAL TfflOLESTERASE PROCESSING 
CONJUGATION 


UBIQUITIN CARBOXYL-TERMINAL HYDROLASES FAMH -Y 2; DM00659|P50101|209- 
458: N45-G299 


Amino Acid 
Residues 


r- 
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Incyte 

Polypeptide 

ID 


I7994355CD1 
















7475875CD1 














71231882CD1 
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What is claimed is: 

1. An isolated polypeptide selected from the group consisting of: 

a) a polypeptide comprising an amino acid sequence selected from the group consisting 
of SEQ ID NO: 1-28, 

b) a polypeptide comprising a naturally occurring amino acid sequence at least 90% 
identical to an amino acid sequence selected from the group consisting of SEQ ID 
NO: 1, SEQ ID NO:3-7, SEQ ID NO:9-19, SEQ ID NO:21-26, and SEQ ID NO:28, 

c) a polypeptide comprising a naturally occurring amino acid sequence at least 94% 
identical to an amino acid sequence selected from the group consisting of SEQ ID 
NO:2andSEQIDNO:20. 

d) a polypeptide comprising a naturally occurring amino acid sequence at least 96% 
identical to an amino acid sequence of SEQ ID NO: 8, 

e) a polypeptide comprising a naturally occurring amino acid sequence at least 97% 
identical to an amino acid sequence of SEQ ID NO:27, 

f) a biologically active fragment of a polypeptide having an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-28, and 

g) an immunogenic fragment of a polypeptide having an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-28. 

2. An isolated polypeptide of claim 1 comprising an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-28. 

3. An isolated polynucleotide encoding a polypeptide of claim 1. 

4. An isolated polynucleotide encoding a polypeptide of claim 2. 

5. An isolated polynucleotide of claim 4 comprising a polynucleotide sequence selected from 
the group consisting of SEQ ID NO:29-56. 

6. A recombinant polynucleotide comprising a promoter sequence operably linked to a 
polynucleotide of claim 3. 

7. A cell transformed with a recombinant polynucleotide of claim 6. 
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8. A transgenic organism comprising a recombinant polynucleotide of claim 6. 



9. A method of producing a polypeptide of claim 1, the method comprising: 

a) culturing a cell under conditions suitable for expression of the polypeptide, wherein 
5 said cell is transformed with a recombinant polynucleotide, and said recombinant 

polynucleotide comprises a promoter sequence operably linked to a polynucleotide 
encoding the polypeptide of claim 1, and 

b) recovering the polypeptide so expressed. 

10 10. A method of claim 9, wherein the polypeptide comprises an amino acid sequence 

selected from the group consisting of SEQ ID NO: 1-28. 

1 1. An isolated antibody which specifically binds to a polypeptide of claim 1. 

12. An isolated polynucleotide selected from the group consisting of: 

a) a polynucleotide comprismg a polynucleotide sequence selected from the group 
consisting of SEQ ID NO:29-56, 

b) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 
90% identical to a polynucleotide sequence selected frona the group consisting of 
SEQ ID NO:29-53 and SEQ ID NO:55-56, 

c) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 
91 % identical to a polynucleotide sequence of SEQ ID NO:54, 

d) a polynucleotide complementary to a polynucleotide of a), 

e) a polynucleotide complementary to a polynucleotide of b), 

f) a polynucleotide complementary to a polynucleotide of c), and 

g) an RNA equivalent of a)-f). 

13. An isolated polynucleotide comprising at least 60 contiguous nucleotides of a 
polynucleotide of claim 12. 

30 

14. A method of detecting a target polynucleotide in a sample, said target polynucleotide 
having a sequence of a polynucleotide of claim 12, the method comprising: 

a) hybridizing the sample with a probe comprising at least 20 contiguous nucleotides 
comprising a sequence complementary to said target polynucleotide in the sample, 
35 which probe specifically hybridizes to said target polynucleotide, under 
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conditions whereby a hybridization complex is formed between said probe and said 
target polynucleotide or fragments thereof, and 
b) detecting the presence or absence of said hybridization complex, and, optionally, if 
present, the amount thereof. 

15. A method of claim 14, wherein the probe comprises at least 60 contiguous nucleotides. 

16. A method of detecting a target polynucleotide in a sample, said target polynucleotide 
having a sequence of a polynucleotide of claim 12, the method comprising: 

a) amplifying said target polynucleotide or fragment thereof using polymerase chain 
reaction an^lification, and 

b) detecting the presence or absence of said amplified target polynucleotide or fragment 
thereof, and, optionally, if present, the amount thereof. 

17. A composition comprising a polypeptide of claim 1 and a pharmaceutically acceptable 
excipient. 

18. A composition of claim 17, wherein the polypeptide comprises an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-28. 

19. A method for treating a disease or condition associated with decreased expression of 
functional PMOD, connprising administering to a patient in need of such treatment the composition of 
claim. 17. 

20. A method of screening a compound for effectiveness as an agonist of a polypeptide of 
claim 1, the method comprising: 

a) exposing a sample comprising a polypeptide of claim 1 to a compound, and 

b) detecting agonist activity in the sample. 

21. A composition comprising an agonist compound identified by a method of claim 20 and a 
pharmaceutically acceptable excipient. 

22. A method for treating a disease or condition associated with decreased expression of 
functional PMOD, comprising administering to a patient jn need of such treatment a composition of 
claim 21. 
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23. A method of screening a compound for effectiveness as an antagonist of a polypeptide of 
claim 1, the method comprising: 

a) exposing a sample comprising a polypeptide of claim 1 to a compoimd, and 

b) detecting antagonist activity in the sample. 

5 

24. A composition comprising an antagonist compound identified by a method of claim 23 
and a pharmaceutically acceptable excipient. 

25. A method for treating a disease or condition associated with overexpression of functional 
10 PMOD, comprising administering to a patient in need of such treatment a composition of claim 24. 

26. A method of screening for a compound that specifically binds to the polypeptide of claim 
1, the method comprising: 

a) combining the polypeptide of claim 1 with at least one test coiiq)ound under suitable 
15 conditions, and 

b) detecting binding of the polypeptide of claim 1 to the test conapound, thereby 
identifying a compound that specifically binds to the polypeptide of claim L 

27. A method of screening for a compound that modulates the activity of the polypeptide of 
20 claim 1, the method comprising: 

a) combining the polypeptide of claim 1 with at least one test compound under 
conditions permissive for the activity of the polypeptide of claim 1, 

b) assessing the activity of the polypeptide of claim 1 in the presence of the test 
compound, and 

25 c) comparing the activity of the polypeptide of claim 1 in the presence of the test 

compound with the activity of the polypeptide of claim 1 in the absence of the test 
compound, wherein a change in the activity of the polypeptide of claim 1 in the 
presence of the test compound is indicative of a compound that modulates the activity 
of the polypeptide of claim 1 . 

30 

28. A method of screening a compound for effectiveness in altering expression of a target 
polynucleotide, wherein said target polynucleotide comprises a sequence of claim 5, the method 
comprising: 

a) exposing a sample comprising the target polynucleotide to a compound, under 
35 conditions suitable for the expression of the target polynucleotide. 
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b) detecting altered expression of the target polynucleotide, and 

c) con5)aring the expression of the target polynucleotide in the presence of varying 
amounts of the compound and in the absence of the compound. 

29. A method of assessing toxicity of a test coiiq)ound, the method comprising: 

a) treating a biological sample containing nucleic acids with the test compound, 

b) hybridizing the nucleic acids of the treated biological sample with a probe comprising 
at least 20 contiguous nucleotides of a polynucleotide of claim 12 under conditions 
whereby a specific hybridization conq)lex is formed between said probe and a target 
polynucleotide in the biological sample, said target polynucleotide comprising a 
polynucleotide sequence of a polynucleotide of claim 12 or fragment thereof, 

c) quantifying the amount of hybridization complex, and 

d) comparing the amount of hybridization complex in the treated biological sample with 
the amount of hybridization conq)Iex in an untreated biological sample, wherein a 
difference in the amount of hybridization complex in the treated biological sample is 
indicative of toxicity of the test compound. 

30. A diagnostic test for a condition or disease associated with the expression of PMOD in a 
biological sample, the method comprising: 

a) combining the biological sample with an antibody of claim 1 1 , under conditions 
suitable for the antibody to bind die polypeptide and form an antibody:polypeptide 
complex, and 

b) detecting the complex, wherein the presence of the complex correlates with the 
presence of the polypeptide in the biological sample. 

31. The antibody of claim 1 1, wherein the antibody is: 

a) a chimeric antibody, 

b) a single chain antibody, 

c) a Fab fragment, 

d) a F(ab')2 fragment, or 

e) a humanized antibody. 

32. A composition comprising an antibody of claim 11 and an acceptable excipient. 

33. A method of diagnosing a condition or disease associated with the expression of PMOD 
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in a subject, comprising administering to said subject an effective amount of the composition of claim 
32. 

34. A composition of claim 32, wherein the antibody is labeled. 

35. A method of diagnosing a condition or disease associated with the expression of PMOD 
in a subject, comprising administering to said subject an effective amount of the composition of claim 
34. 

36. A method of preparing a polyclonal antibody with the specificity of the antibody of claim 
1 1 , the method comprising: 

a) immunizing an animal with a polypeptide consisting of an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-28, or an immunogenic fragment 
thereof, under conditions to elicit an antibody response, 

b) isolating antibodies from said animal, and 

c) screening the isolated antibodies with the polypeptide, thereby identifying a 
polyclonal antibody which specifically binds to a polypeptide comprising an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-28. 

37. A polyclonal antibody produced by a method of claim 36. 

38. A composition comprising the polyclonal antibody of claim 37 and a suitable carrier. 

39. A method of making a monoclonal antibody with the specificity of the antibody of claim 
11 , the method comprising: 

a) immunizing an animal with a polypeptide consisting of an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-28, or an immunogenic fragment 
thereof, under conditions to elicit an antibody response, 

b) isolating antibody producing cells from the animal, 

c) fusing the antibody producing cells with immortalized cells to form monoclonal 
antibody-producing hybridoma cells, 

d) culturing the hybridoma cells, and 

e) isolating from the culture monoclonal antibody which specifically binds to a 
polypeptide comprising an amino acid sequence selected from the group consisting of 
SEQ ID NO: 1-28. 
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40. A monoclonal antibody produced by a method of claim 39. 

41. A composition comprising the monoclonal antibody of claim 40 and a suitable carrier, 

5 42. The antibody of claim 1 1, wherein the antibody is produced by screening a Fab 

expression library. 

43. The antibody of claim 11, wherein the antibody is produced by screening a recombinant 
immunoglobulin library. 

10 

44. A method of detecting a polypeptide comprising an amino acid sequence selected from 
the group consisting of SEQ ID NO: 1-28 in a sample, the method comprising: 

a) incubating the antibody of claim 1 1 with a sample under conditions to allow specific 
binding of the antibody and the polypeptide, and 
15 b) detecting specific binding, wherein specific binding indicates the presence of a 

polypeptide comprising an amino acid sequence selected from the group consisting of 
SEQ ID NO: 1-28 in the sample. 



45. A method of purifying a polypeptide comprising an amino acid sequence selected from 
20 the group consisting of SEQ ID NO: 1-28 from a sample, the method comprising: 

a) incubating the antibody of claim 1 1 with a sample under conditions to allow specific 
binding of the antibody and the polypeptide, and 

b) separating the antibody from the sample and obtaining the purified polypeptide 
conq)rising an amino acid sequence selected from the group consisting of SEQ ID 

25 NO: 1-28. 



46. A microarray wherein at least one element of the microarray is a polynucleotide of claim 



13. 



30 47. A method of generating an expression profile of a sample which contains 

polynucleotides, the method comprising: 

a) labeling the polynucleotides of the sample, 

b) contacting the elements of the microarray of claim 46 with the labeled 
polynucleotides of the sample under conditions suitable for the formation of a 

35 hybridization complex, and 
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c) quantifying the expression of trie polynucleotides in the sample. 

48. An array comprising different nucleotide molecules affixed in distinct physical locations 
on a solid substrate, wherein at least one of said nucleotide molecules comprises a &st 

5 oligonucleotide or polynucleotide sequence specifically hybridizable with at least 30 contiguous 
nucleotides of a target polynucleotide, and wherein said target polynucleotide is a polynucleotide of 
claim 12. 

49. An array of claim 48, wherein said first oligonucleotide or polynucleotide sequence is 
10 completely complementary to at least 30 contiguous nucleotides of said target polynucleotide. 

50. An array of claim 48, wherein said first oUgonucleotide or polynucleotide sequence is 
completely complementary to at least 60 contiguous nucleotides of said target polynucleotide. 

15 51. An array of claim 48, wherein said first oligonucleotide or polynucleotide sequence is 

completely complementary to said target polynucleotide. 

52. An array of claim 48, which is a microarray. 

20 53. An array of claim 48, further comprising said target polynucleotide hybridized to a 

nucleotide molecule comprising said first oligonucleotide or polynucleotide sequence. 

54. An array of claim 48, wherein a Unker joins at least one of said nucleotide molecules to 
said solid substrate. 

25 

55. An array of claim 48, wherein each distinct physical location on the substrate contains 
multiple nucleotide molecules, and the multiple nucleotide molecules at any single distinct physical 
location have the same sequence, and each distinct physical location on the substrate contains 
nucleotide molecules having a sequence which differs from the sequence of nucleotide molecules at 

30 another distinct physical location on the substrate. 

56. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID N0:1. 

57. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID N0:2. 

35 
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58. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID N0:3. 

59. A polypeptide of claim I; comprising the amino acid sequence of SEQ ID N0:4. 

60. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:5. 

61. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID N0:6. 

62. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:7. 

63. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID N0:8. 

64. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO; 9. 

65. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 10. 

66. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 11. 

67. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 12. 

68. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 13. 

69. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 14. 

70. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 15. 

71. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 16. 

72. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 17. 

73. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 18. 

74. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 19. 

75. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:20. 
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76. A polypeptide of claim 1, comprising the amino acid sequence of SEQ E) N0:21. 

77. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:22. 

78. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:23. 

79. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:24. 

80. A polypeptide of claim 1 , comprising the amino acid sequence of SEQ ID NO:25. 

81. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:26. 

82. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:27. 

83. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:28. 

84. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:29. 

85. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:30. 

86. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:31. 

87. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:32. 

88. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:33. 

89. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:34. 

90. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
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NO:35. 

91. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:36. 

5 

92. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:37. 

93. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

10 NO:38. 

94. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:39. 

15 95. A polynucleotide of claim 12, conq)rising the polyiiucleotide sequence of SEQ ID 

NO:40. 

96. A polynucleotide of claim 12, conqnising the polynucleotide sequence of SEQ ID 

N0:41. 

20 

97. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:42. 

98. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

25 NO:43. 

99. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:44. . 

30 100. A polynucleotide of claim 12, conpising the polynucleotide sequence of SEQ ID 

NO:45. 

101. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:46. 

35 
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102. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NQ:47. 

103. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

5 NO:48. 

104. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:49. 

10 105. A polynucleotide of claim 12, comprising the pplynucleotide sequence of SEQ ID 

NO:50. 

106. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

N0:51. 

15 

107. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:52. 

108. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

20 NO:53. 

109. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:54. 

25 1 10. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:55. 

1 1 1: A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:56. 
30 . 
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<150> US 60/315,992 
<151> 2001-08-29 

<150> US 60/378,205 
<151> 2002-05-03 

<160> 56 

<170> PERL Program 

<210> 1 

<211> 774 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc^feature ^ 
<223> Incyte ID No: 7994355CD1 

<400> 1 

Met Ala Asp Gin His Arg" Ser Val Ser Glu Leu Leu Ser Asn Ser 
1 5 10 15 

Lys Phe Asp Val Asn Tyr Ala Phe Gly Arg Val Lys Arg Ser Leu 
20 25 30 

Leu His lie Ala Ala Asn Cys Gly Ser Val Glu Cys Leu Val Leu 
35 40 45 

Leu Leu Lys Lys Gly Ala Asn Pro Asn Tyr Gin Asp lie Ser Gly 
50 55 60 

Cys Thr Pro Leu His Leu Ala Ala Arg Asn Gly His Gly Gin Arg 
65 70 75 

Asp Thr Ala Gin lie Leu Leu Leu Arg Gly Ala Lys Tyr Leu Pro 
80 85 90 

Asp Lys Asn Gly Val Thr Pro Leu Asp Leu Cys Val Gin Gly Gly 
95 100 105 

Tyr Gly Glu Thr Cys Glu Val Leu lie Gin Tyr His Pro Arg Leu 

110 115 120 

Phe Gin Thr lie lie Gin Met Thr Gin Asn Glu Asp. Leu Arg Glu 

125 130 135 

Asn Met Leu Arg Gin Val Leu Glu His Leu Ser Gin Gin Ser Glu 

140 * 145 150 

Ser Gin Tyr Leu Lys He Leu Thr Ser Leu Ala Glu Val Ala Thr 

155 ' 160 165 

Thr Asn Gly His Lys Leu Leu Ser Leu Ser Ser Asn Tyr Asp Ala 

170 175 180 

Gin Met Lys Ser Leu Leu Arg He Val Arg Met Phe Cys His Val 

185 190 195 

Phe Arg He Gly Pro Ser Ser Pro Ser Asn Gly He Asp Met Gly 

200 205 210 

Tyr Asn Gly Asn Lys Thr Pro Arg Ser Gin Val Phe Lys Pro Leu 

215 220 225 

Glu Leu Leu Trp His Ser Leu Asp Glu Trp Leu Val Leu He Ala 

230 235 240 

Thr Glu Leu Met Lys Asn Lys Arg Asp Ser Thr Glu He Thr Ser 

245 250 255 

He Leu Leu Lys Gin Lys Gly Gin Asp Gin Asp Ala Ala Ser He 

260 265 270 

Pro Pro Phe Glu Pro Pro Gly Pro Gly Ser Tyr Glu Asn Leu Ser 

275 280 285 

Thr Gly Thr Arg Glu Ser Lys Pro Asp Ala Leu Ala Gly Arg Gin 

290 295 300 

Glu Ala Ser Ala Asp Cys Gin Asp Val He Ser Met Thr Ala Asn 

305 310 315 

Arg Leu Ser Ala Val He Gin Ala Phe Tyr Met Cys Cys Ser Cys 
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<210> 2 
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335 
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Glu 


Val 




350 
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His 
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Arg 
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Met 




380 




Glu 


Trp 
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395 




Met 
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Arg 
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Ser 
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440 




Glu 
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Gly 
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Asp 
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Ala 
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Asp 




560 




Ser 


Val 


Glu 
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Gly 
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Glu 
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Val 
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Gin 




620 
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Leu 
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Gly 
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755 




Tyr 


Gly 


Tyr 
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Asp 
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tsXn 


vsXy 


mec 


Pro 


Tyr 


inr 


Ser 


Trp 
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Ala 
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He 
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Met 
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Arg 


Thr 


Met 


Ala 



325 






Arg 
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He 


340 
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val 
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355 
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Leu 


Glu 


370 
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Ala 


Gin 


385 






Leu 


His 


Ser 


400 






Asn 


Glu 


Asn 


415 






Ser 


Ser 


Cys 


430 






Gin 


Gly 


He 


445 






Gly 


Val 


Val 


460 






Asn 


Pro 


Asp 


475 






Phe 


Gin 


Pro 


490 






Tyr 


Phe 


Arg 


505 






Arg 


Gin 


Leu 


520 






He 


Leu 


Gly 


535 






Pro 


Glu 


Tyr 


550 






Ser 


Asp 


Leu 


565 
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Gly 


Ala 


580 






He 


Leu 


Val 


595 






Thr 


Glu 


Leu 


610 






Phe 
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Gin 


625 






Leu 


Phe 


Asp 


640 






Glu 


He 


Asp 


655 






Gly 


Tyr 


Glu 


670 






Val 


Glu 


Asp 


685 






Val 


Thr 


Gly 


700 






Met 


Gly 


Gly 


715 






Tyr 


Thr 


Pro 


730 






Leu 


Lys 


Leu 


745 






Leu 


Leu 


Val 


760 
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Val 






345 


Arg 
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Pro 






360 
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Pro 


Glu 






375 


Pro 
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Gly 
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Pro 
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He 
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420 
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Val 
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570 


Met 


Glu 


Glu 
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615 
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660 
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720 


Asn 


Leu 


Leu 






735 


Pro 


Glu 
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<211> 703 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc^feature 

<223> Incyte 10 No: 7475875CD1 

<400> 2 

Met Ala Cys Ser Met Ala Cys Gly Gly Arg Ala Cys Lys Tyr Glu 
15 10 15 

Asn Pro Ala Arg Trp Ser Glu Gin Glu Gin Ala He Lys Gly Val 
20 25 ■ 30 

Tyr Ser Ser Trp Val Thr Asp Asn He Leu Ala Met Ala Arg Pro 
35 40 45 

Ser Ser Glu Leu Leu Glu Lys Tyr His He He Asp Gin Phe Leu 
50 55 60 

Ser His Gly He Lys Thr He He Asn Leu Gin Arg Pro Gly Glu 
65 70 75 

His Ala Ser Cys Gly Asn Pro Leu Glu Gin Glu Ser Gly Phe Thr 
80 85 90 

Tyr Leu Pro Glu Ala Phe Met Glu Ala Gly He Tyr Phe Tyr Asn 
95 . 100 105 

Phe Gly Trp Lys Asp Tyr Gly Val Ala Ser Leu Thr Thr He Leu 

110 115 120 

Asp Met Val Lys Val Met Thr Phe Ala Leu Gin Glu Gly Lys Val 

125 130 135 

Ala He His Cys His Ala Gly Leu Gly Arg Thr Gly Val Leu He 

140 145 150 

Ala Cys Tyr Leu Val Phe Ala Thr Arg Met Thr Ala Asp Gin Ala 

155 160 165 

He He Phe Val Arg Ala Lys Arg Pro Asn Ser He Gin Thr Arg 

170 175 180 

Gly Gin Leu Leu Cys Val Arg Glu Phe Thr Gin Phe Leu Thr Pro 

185 190 195 

Leu Arg Asn He Phe Ser Cys Cys Asp Pro Lys Ala His Ala Val 

200 205 210 

Thr Leu Pro Gin Tyr Leu He Arg Gin Arg His Leu Leu His Gly 

215 220 225 

Tyr Glu Ala Arg Leu Leu Lys His Val Pro Lys He He His Leu 

230 235 240 

Val Cys Lys Leu Leu Leu Asp Leu Ala Glu Asn Arg Pro Val Met 

245 250 255 

Met Lys Asp Val Ser Glu Gly Pro Gly Leu Ser Ala Glu He Glu 

260 265 270 

Lys Thr Met Ser Glu Met Val Thr Met Gin Leu Asp Lys Glu Leu . 

275 280 285 

Leu Arg His Asp Ser Asp Val Ser Asn Pro Pro Asn Pro Thr Ala 

290 295 300 

Val Ala Ala Asp Phe Asp Asn Arg Gly Met He Phe Ser Asn Glu 

305 310 315 

Gin Gin Phe Asp Pro Leu Trp Lys Arg Arg Asn Val Glu Cys Leu 

320 325 330 

Gin* Pro Leu Thr His Leu Lys Arg Arg Leu Ser Tyr Ser Asp Ser 

335 340 345 

Asp Leu Lys Arg Ala Glu Asn Leu Leu Glu Gin Gly Glu Thr Pro 

350 355 360 

Gin Thr Val Pro Ala Gin He Leu Val Gly His Lys Pro Arg Gin 

365 370 375 

Gin Lys Leu He Ser His Cys Tyr He Pro Gin Ser Pro Glu Pro 

380 385 390 

Asp Leu His Lys Glu Ala Leu Val Arg Ser Thr Leu Ser Phe Trp 

395 400 405 
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<210> 3 

<211> 1256 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 71231882CD1 

<400> 3 
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Arg He He Pro Leu Gin Leu Gin Arg Leu Phe Ala Gin Leu Leu 
. 95 100 105 

Leu Leu Asp Gin Glu Ala Ala Ser Thr Ala Asp Leu Thr Asp Ser 
110 115 120 

Phe Gly Trp Thr Ser Asn Glu Glu Met Arg Gin His Asp Val Gin 
125 130 135 

Glu Leu Asn Arg He Leu Phe Ser Ala Leu Glu Thr Ser Leu Val 
140 145 150 

Gly Thr Ser Gly His Asp Leu He Tyr Arg Leu Tyr His Gly Thr 
155 160 165 

He Val Asn Gin He Val Cys Lys Glu Cys Lys Asn Val Ser Glu 
170 175 180 

Arg Gin Glu Asp Phe Leu Asp Leu Thr Val Ala Val Lys Asn Val 
185 190 195 

Ser Gly Leu Glu Asp Ala Leu Trp Asn Met Tyr Val Glu Glu Glu 
200 205 210 

Val Phe Asp Cys Asp Asn Leu Tyr His Cys Gly Thr Cys Asp Arg 
215 220 225 

Leu Val Lys Ala Ala Lys Ser Ala Lys Leu Arg Lys Leu Pro Pro 
230 235 240 

Phe Leu" Thr Val Ser Leu Leu Arg Phe Asn Phe Asp Phe Val Lys 
245 250 255 

Cys Glu Arg Tyr Lys Glu Thr Ser Cys Tyr Thr Phe Pro . Leu Arg 
260 ' 265 270 

He Asn Leu Lys Pro Phe Cys Glu Gin Ser Glu Leu Asp Asp Leu 
275 280 285 

Glu Tyr He Tyr Asp Leu Phe Ser Val He He His Lys Gly Gly 
290. 295 300 

Cys Tyr Gly Gly His Tyr His Val Tyr He Lys Asp Val Asp His 
305 310 315 

Leu Gly Asn Trp Gin Phe Gin Glu Glu Lys Ser Lys Pro Asp Val 
320 325 330 

Asn Leu Lys Asp Leu Gin Ser Glu Glu Glu He Asp His Pro Leu 
335 340 345 

Met He Leu Lys Ala He Leu Leu Glu Glu Glu Asn Asn Leu He 
350 355 360 

Pro Val Asp Gin Leu Gly Gin Lys Leu Leu Lys Lys He Gly He 
365 370 375 

Ser Trp Asn Lys Lys Tyr Arg Lys Gin His Gly Pro Leu Arg Lys 
380 385 390 

Phe Leu Gin Leu His Ser Gin He Phe Leu Leu Ser Ser Asp Glu 
395 400 405 

Ser Thr Val Arg Leu Leu Lys Asn Ser Ser Leu Gin Ala Glu Ser 
410 415 420 

Asp Phe Gin Arg Asn Asp. Gin Gin He Phe Lys Met Leu Pro Pro 
425 430 435 

Glu Ser Pro Gly Leu Asn Asn Ser He Ser Cys Pro His Trp Phe 
440 445 450 

Asp lie Asn Asp Ser Lys Val Gin Pro He Arg Glu Lys Asp He 
455 460 465 

Glu Gin Gin Phe Gin Gly Lys Glu Ser Ala Tyr Met Leu Phe Tyr 
470 • 475 , 480 

Arg Lys Ser Gin Leu Gin Arg Pro Pro Glu Ala Arg Ala Asn Pro 
485 490 495 

Arg Tyr Gly Val Pro Cys His Leu Leu Asn Glu Met Asp Ala Ala 
500 505 510 

Asn He Glu Leu Gin Thr Lys Arg Ala Glu Cys Asp Ser Ala Asn 
515 520 525 

Asn Thr Phe Glu Leu His Leu His Leu Gly Pro Gin Tyr His Phe 
530 535 540 

Phe Asn Gly Ala Leu His Pro Val Val Ser Gin Thr Glu Ser Val 
545 550 555 

Trp Asp Leu Thr Phe Asp Lys Arg Lys Thr Leu Gly Asp Leu Arg 
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940 


Leu 


Leu 


He 


Glu 


Gly 
945 


Gin 


Leu 


Pro 


Pro 


Leu 
950 


Gly 


Phe 


Leu 


Lys 


Val 
955 


Pro 


He 


Trp 


Trp 


Tyr 
960 


Gin 


Leu 


Gin 


Gly 


Pro 
965 


Ser 


Gly 


His 


Trp 


Glu- 
970 


Ser 


His 


Gin 


Asp 


Gin 
975 


Thr 


Asn 


Cys 


Thr 


Ser 

980 


Ser 


Trp 


Gly 


Arg 


Val 
985 


Trp 


Arg 


Ala 


Thr 


Ser 

990 


Ser 


Gin 


Gly 


Ala 


Ser 


Gly 


Asn 


Glu 


Pro 


Ala 


Gin 


Val 


Ser 


Leu 


Leu 










995 








1000 








1005 


Tyr 


Leu 


Gly 


Asp 


He 


Glu 


He 


Ser 


Glu Asp Ala 


Thr 


Leu 


Ala 


Glu 








1010 








1015 








1020 


Leu 


Lys 


Ser 


Gin Ala 


Met 


Thr 


Leu 


Pro 


Pro 


Phe 


Leu 


Glu 


Phe Gly 








1025 








1030 








1035 
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Val Pro Ser Pro Ala His Leu Arg Ala Trp Thr Val Glu Arg Lys 

1040 1045 1050 

Arg Pro Gly Arg Leu Leu Arg Thr Asp Arg Gin Pro Leu Arg Glu 

1055 1060 1065 

Tyr Lys Leu Gly Arg Arg lie Glu lie Cys Leu Glu Pro Leu Gin 

1070 1075 1080 

Lys Gly Glu Asn Leu Gly Pro Gin Asp Val Leu Leu Arg Thr Gin 

1085 1090 1095 

Val Arg He Pro Gly Glu Arg Thr Tyr Ala Pro Ala Leu -Asp Leu 

1100 1105 1110 

Val Trp Asn Ala Ala Gin Gly Gly Thr Ala Gly Ser Leu Arg Gin 

1115 1120 1125 

Arg Val Ala Asp Phe Tyr Arg Leu Pro Val Glu Lys He Glu He 

1130 1135 1140 

Ala Lys Tyr Phe Pro Glu Lys Phe Glu Trp Leu Pro He Ser Ser 

1145 1150 1155 

Trp Asn Gin Gin He Thr Lys Arg Lys Lys Lys Lys Lys Gin Asp 

1160 1165 1170 

Tyr Leu Gin Gly Ala Pro Tyr Tyr Leu Lys Asp Gly Asp Thr He 

H75 1180 1185 

Gly Val Lys Asn Leu Leu He Asp Asp Asp Asp Asp Phe Ser Thr 

1190 1195 1200 

He Arg Asp Asp Thr Gly Lys Glu Lys Gin Lys Gin Arg Ala Leu 

1205 1210 1215 

Gly Arg Arg Lys Ser Gin Glu Ala Leu His Glu Gin Ser Ser Tyr 

1220 1225 1230 

He Leu Ser Ser Ala Glu Thr Pro Ala Arg Pro Arg Ala Pro Glu 

1235 1240 1245 

Thr Ser Leu Ser He His Val Gly Ser Phe Arg 

1250 1255 

<210> 4 
<211> 755 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> inisc_fe^ture 

<223> Incyte ID No: 2B75922CD1 

<400> 4 

Met Lys Lys Gin Arg Lys He Leu Trp Arg Lys Gly He His Leu 
15 10 15 

Ala Phe Ser Glu Lys Trp Asn Thr Gly Phe Gly Gly Phe Lys Lys 
20 25 30 

Phe Tyr Phe His Gin His Leu Cys He Leu Lys Ala Lys Leu Gly 
35 40 45 

Arg Pro Val Thr Trp Asn Arg Gin Leu Arg His Phe Gin Gly Arg 
50 55 60 

Lys Lys Ala Leu Gin He Gin Lys Thr Trp He Lys Asp Glu Pro 
65 70 75 

Leu Cys Ala Lys Thr Lys Phe Asn Val Ala Thr Gin Asn Val Ser 
80 85 90 

Thr Leu Ser Ser Lys Val Lys Arg Lys Asp Ala Lys His Phe He 
95 100 105 

Ser Ser Ser Lys Thr Leu Leu Arg Leu Gin Ala Glu Lys Leu Leu 
110 115 120 

Ser Ser Ala Lys Asn Ser Asp His Glu Tyr Cys Arg Glu Lys Asn 
125 130 135 

Leu Leu Lys Ala Val Thr Asp Phe Pro Ser Asn Ser Ala Leu Gly 
140 145 150 

Gin Ala Asn Gly His Arg Pro Arg Thr Asp Pro Gin Pro Ser Asp 
155 160 165' 
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Phe 


Pro 


Met 


Gly 


Thr 


He 


Cys 


Tyr 


Gly 


Pro 


Leu 


He 


Lys 


Leu 


Ser 


Phe 


Arg 


Tyr 


Lys 


Val 


Cys 


Val 


Thr 


Gly 


Gly 


Ser 


Cys 


Cys 


Arg 


His 


Val 


Lys 


Gly 


Ser 


Ser 


Phe 


Asp 


Gin 


Gin 


Cys 


Ser 


Ser 


Pro 


Leu 


Pro 


Glu 


Arg 


Glu 


Val 


Ser 


Asp 


Ser 


Val 


Ser 


Asn 


Glu 


Asn 


Ser 


He 


Leu 


Pro 


Leu 


Glu 


Gin 


Val 


Gly 


Asp 


Glu 


Gin 


Met 


Lys 


Lys 


Leu 


Gly 


Arg 


Lys 


Pro 


Phe 


Gin 


Lys 


Cys 


Met 


Asp 


Asp 


Gin 


Val 


He 


Asp 


Lys 


Val 


Thr 


Lys 


Gly 


Leu 


Phe 


Lys 



Lys 


Phe 


Asn 




170 




Val 


Val 


Thr 




185 




Cys 


Cys 


Gin 




200 




Pro 


Lys 


Lys 




215 




Pro 


Leu 


Met 




230 




Arg 


He 


Leu 




245 




Lys 


Leu 


Arg 




260 




Asp 


His 


Gin 




275 




Ser 


Pro 


Phe 




290 




Gin 


Pro 


Tyr 




305 




Thr 


Asn 


Ser 




320 




Leu 


Gly 


Lys 




335 




Asn 


Gly 


Ser 




350 




Pro 


Lys 


Trp 




365 




Glu 


His 


Arg 




380 




He 


Met 


Thr 




395 




Asp 


Arg 


Val 




410 




Ser 


Val 


Asp 




425 




Ser 


Tyr 


Gin 




440 




Ser 


Ser 


Glu 




455 




Ala 


Pro 


Leu 




470 




Gly 


Gly 


Lys 




485 




Leu 


Ser 


Val 




500 




Tyr 


Gly 


Ser 




515 




Leu 


Lys 


Asp 




530 




He 


Asn 


Arg 




545 




Asn 


Phe 


Arg 




560 




Leu 


Ala 


Thr 




575 




Asn 


Met 


Tyr 




590 




His 


Phe 


Phe 




605 




Tyr 


Asn 


Gly 




620 




Lys 


Ser 


Leu 



Gly 


Glu 


Ser 


Leu 


Asn 


Asn 


Gly 


Pro 


Glu 


Phe" 


Gin 


Leu 


Met 


Tyr 


Glu 


Arg 


Ser 


Gin 


Lys 


Ala 


Gin 


Glu 


Thr 


Arg 


Pro 


Ser 


Pro 


Phe 


Pro 


Asp 


His 


Val 


Pro 


Glu 


Leu 


Ser 


Ala 


Thr 


Asn 


Glu 


Cys 


Thr 


Ser 


Asn 


Thr 


Leu 


Gly 


Gin 


Lys 


Leu 


Ser 


Gly 


Pro 


Val 


Met 


Glu 


Glu 


Leu 


Leu 


Asp 


Val 


Cys 


Ser 


Asn 


Ser 


Gin 


Cys 


Leu 


Ser 


Leu 


Val 


Pro 


Val 


Phe 


Asn 


Glu 


He 


Thr 


He 


Phe 


Tyr 


Leu 


Asp 


Gly 


Gly 


Glu 


Leu 


Asn 


Ser 


Phe 


Val 


Lys 


Arg 


Leu 


Leu 


He 



Gin 


Ser 


Pro 


175 






His 


Lys 


Arg 


190 






His 


His 


Arg 


205 






Asn 


Gin 


His 


220 






Lys 


Leu 


Ser 


235 






His 


Phe 


Arg 


250 






Arg 


Ser 


Trp 


265 






Arg 


Glu 


Asn 


280 






Glu 


Pro 


Lys 


295 






Met 


Asp 


Ser 


310 






Asp 


Cys 


His 


325 






Leu 


Asp 


Glu 


340 






Ala 


Trp 


Asp 


355 






Glu 


Leu 


He 


370 






Met 


Phe 


He 


385 






Glu 


Asn 


Gin 


400 






Val 


Ser 


Gly 


415 






Ser 


Gin 


Lys 


430 






Asp 


Gly 


Ser 


445 






His 


Pro 


Tyr 


460 






Gly 


Leu 


Lys 


475 






Lys 


Ala 


Ser 


490 






Gly 


Phe 


Leu 


505 






Leu 


Ser 


Glu 


520 






Glu 


Asp 


Phe 


535 






Asn 


Tyr 


Arg 


550 






Asn 


Lys 


His 


565 






Gin 


Asn 


Trp 


580 






He 


Met 


Asp 


595 






Phe 


His 


Arg 


610 






Trp 


Thr 


Lys 


625 






Pro 


He 


His 



9/59 



Gly 


Glu 


Ser 






180 


Lys 


Gly 


Phe 






195 


Asn 


Gly 


Gly 






210 


Arg 


Arg 


He 






225 


Met 


He 


Arg 






240 


Thr 


Lys 


Ser 






255 


Val 


Gin 


Lys 






270 


Gly 


Glu 


Gly 






285 


Asp 


Pro 


Ser 






300 


Ser 


Ala 


Val 






315 


Thr 


Lys 


Gly 






330 


Ala 


Phe 


Pro 






345 


Gin 


Ser 


Ser 






360 


His 


Asp 


He 






375 


Ser 


Glu 


Thr 






390 


Thr 


Ser 


Ser 






405 


Ala 


Asp 


Thr 






420 


Ala 


Val 


Gin 






435 


Leu 


Lys 


Gin 






450 


Cys 


Lys 


Ser 






465 


Leu 


Glu 


Asn 






480 


Pro 


Val 


Asp 






495 


Asp 


Glu 


Val 






510 


Lys 


Glu 


Val 






525 


Ser 


Asn 


Arg 






540 


Ala 


Arg 


His 






555 


Met 


Leu 


Asp 






570 


Leu 


Asn 


Asp 






585 


Ala 


Val 


Pro 






600 


Gin 


Leu 


Val 






615 


Lys 


Val 


Asp 






630 


Leu 


Glu 


Val 
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635 




640 




645 


His 


Trp 


Ser 


Leu 


lie Thr Val 


Thr 


Leu Ser Asn Arg 


He He Ser 








650 




655 




660 


Phe 


Tyr 


Asp 


Ser 


Gin Gly lie 


His 


Phe Lys 


Phe Cys Val Glu Asn 








665 




670 




675 


lie 


Arg 


Lys 


Tyr 


Leu Leu Thr 


Glu 


Ala Arg 


Glu Lys 


Asn Arg Pro 








680 




685 




690 


Glu 


Phe 


Leu 


Gin 


Gly Trp Gin 
695 


Thr 


Ala Val 
700 


Thr Lys 


Cys He Pro 
705 


Gin 


Gin 


Lys 


Asn 


Asp Ser Asp 
710 


Cys 


Gly Val 

715 


Phe Val 


Leu Gin Tyr 
720 


Cys 


Lys 


Cys 


Leu 


Ala Leu Glu 


Gin 


Pro Phe 


Gin Phe 


Ser Gin Glu 






725 




730 




735 


Asp 


Met 


Pro 


Arg 


Val Arg Lys 


Arg 


lie Tyr 


Lys Glu 


Leu Cys Glu 








740 




745 




750 


Cys 


Arg 


Leu 


Met 


Asp 
755 











<210> 5 

<211> 1034 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 8158136CD1 

<400> 5 

Met Ala Pro Arg Leu Gin Leu Glu Lys Ala Ala Trp Arg Trp Ala 
15 10 15 

Glu Thr Val Arg Pro Glu Glu Val Ser Gin Glu His He Glu Thr 
20 25 30 

Ala Tyr Arg He Trp Leu Glu Pro Cys He Arg Gly Val Cys Arg 
35 40 45 

Arg Asn Cys Lys Gly Asn Pro Asn Cys Leu Val Gly He Gly Glu 
50 55 60 

His He Trp Leu Gly Glu He Asp Glu Asn Ser Phe His Asn He 
65 70 75 

Asp Asp Pro Asn Cys Glu Arg Arg Lys Lys Asn Ser Phe Val Gly 
80 85 90 

Leu Thr Asn Leu Gly Ala Thr Cys Tyr Val Asn Thr- Phe Leu Gin 
95 100 105 

Val Trp Phe Leu Asn Leu Glu Leu Arg Gin Ala Leu Tyr Leu Cys 
110 115 120 

Pro Ser Thr Cys Ser Asp Tyr Met Leu Gly Asp Gly He Gin Glu 
125 130 135 

Glu Lys Asp Tyr Glu Pro Gin Thr He Cys Glu His Leu Gin Tyr 
140 145 150 

Leu Phe Ala Leu Leu Gin Asn Ser Asn Arg Arg Tyr He Asp Pro 
155 160 165 

Ser Gly Phe Val Lys Ala Leu Gly Leu Asp Thr Gly Gin Gin Gin 
170 175 180 

Asp Ala Gin Glu Phe Ser Lys Leu Phe Met Ser Leu Leu Glu Asp 
• . 185 190 195 

Thr Leu Ser Lys Gin Lys Asn Pro Asp Val Arg Asn He Val Gin 
200 205 210 

Gin Gin Phe Cys Gly Glu Tyr Ala -Tyr Val Thr Val Cys Asn Gin 
215 220 225 

Cys Gly Arg Glu Ser Lys Leu Leu Ser Lys Phe Tyr Glu Leu Glu 
230 235 240 

Leu Asn He Gin Gly His Lys Gin Leu Thr Asp Cys He Ser Glu 
245 250 255 

Phe Leu Lys Glu Glu Lys Leu Glu Gly Asp Asn Arg Tyr Phe Cys 
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260 










265 










270 


Glu 


Asn 


Cys 


Gin 


Ser 
275 


Lys 


Gin 


Asn 


Ala 


Thr 
280 


Arg 


Lys 


He 


Arg 


Leu 
285 


Leu 


Ser 


Leu 


Pro 


Cys 
290 


Thr 


Leu 


Asn 


Leu 


Gin 
295 


Leu 


Met 


Arg 


Phe 


Val 
300 


Phe 


Asp 


Arg 


Gin 


Thr 
305 


Gly 


His 


Lys 


Lys 


Lys 
310 


Leu 


Asn 


Thr 


Tyr 


He 
315 


Gly 


Phe 


Ser 


Glu 


He 
320 


Leu 


Asp 


Met 


Glu 


Pro 
325 


Tyr 


Val 


Glu 


His 


Lys 
330 


Gly 


Gly 


Ser 


Tyr 


Val 

335 


Tyr 


Glu 


Leu 


Ser 


Ala 

340 


Val 


Leu 


He 


His 


Arg 

345 


Gly 


Val 


Ser 


Ala 


Tyr 
350 


Ser 


Gly 


His 


Tyr 


He 
355 


Ala 


His 


Val 


Lys 


Asp 
360 


Pro 


Gin 


Ser 


Gly 


Glu 
365 


Trp 


Tyr 


Lys 


Phe 


Asn 
370 


Asp 


Glu 


Asp 


He 


Glu 
375 


Lys 


Met 


Glu 


Gly 


Lys 
380 


Lys 


Leu 


Gin 


Leu 


Gly 
385 


He 


Glu 


Glu 


Asp 


Leu 
390 


Ala 


Glu 


Pro 


Ser 


Lys 
395 


Ser 


Gin 


Thr 


Arg 


Lys 
400 


Pro 


Lys 


Cys 


Gly 


Lys 
405 


Gly 


Thr 


His 


Cys 


Ser 
410 


Arg 


Asn 


Ala 


Tyr 


Met 
415 


Leu 


Val 


Tyr 


Arg 


Leu 
420 


Gin 


Thr 


Gin 


Glu 


Lys 
425 


Pro 


Asn 


Thr 


Thr 


Val 
430 


Gin 


Val 


Pro 


Ala 


Phe 
435 


Leu 


Gin 


Glu 


Leu 


Val 
440 


Asp 


Arg 


Asp 


Asn 


Ser 
445 


Lys 


Phe 


Glu 


Glu 


Trp 
450 


Cys 


He 


Glu 


Met 


Ala 
455 


Glu 


Met 


Arg 


Lys 


Gin 
460 


Ser 


Val 


Asp 


Lys 


Gly 
465 


Lys 


Ala 


Lys 


His 


Glu 

470 


Glu 


Val 


Lys 


Glu 


Leu 
'475 


Tyr 


Gin 


Arg 


Leu 


Pro 
480 


Ala 


Gly 


Ala 


Glu 


Pro 
485 


Tyr 


Glu 


Phe 


Val 


Ser 
490 


Leu 


Glu 


Trp 


Leu 


Gin 
495 


Lys 


Trp 


Leu 


Asp 


Glu 
500 


Ser 


Thr 


Pro 


Thr 


Lys 
505 


Pro 


He 


Asp 


Asn 


His 
510 


Ala 


Cys 


Leu 


Cys 


Ser 
515 


His 


Asp 


Lys 


Leu 


His 
520 


Pro 


Asp 


Lys 


He 


Ser 
525 


He 


Met 


Lys 


Arg 


He 
530 


Ser 


Glu 


Tyr 


Ala 


Ala 
535 


Asp 


He 


Phe 


Tyr 


Ser 
540 


Arg 


Tyr 


Gly 


Gly 


Gly 
545 


Prd 


Arg 


Leu 


Thr 


Val 
550 


Lys 


Ala 


Leu 


Cys 


Lys 
555 


Glu 


Cys 


Val 


Val 


Glu 
560 


Arg 


Cys 


Arg 


He 


Leu 
565 


Arg 


Leu 


Lys 


Asn 


Gin 

570 


Leu 


Asn 


Glu 


Asp 


Tyr 
575 


Lys 


Thr 


Val 


Asn 


Asn 
580 


Leu 


Leu 


Lys 


Ala 


Ala 
585 


Val 


Lys 


Gly 


Asp 


Gly 
590 


Phe 


Trp 


Val 


Gly 


Lys 
595 


Ser 


Ser 


Leu 


Arg 


Ser 
600 


Trp 


Arg 


Gin 


Leu 


Ala 
605 


Leu 


Glu 


Gin 


Leu 


Asp 
610 


Glu 


Gin 


Asp 


Gly 


Asp 
615 


Ala 


Glu 


Gin 


Ser 


Asn 
620 


Gly 


Lys 


Met 


Asn 


Gly 
625 


Ser 


Thr 


Leu 


Asn 


Lys 
630 


Asp 


Glu 


Ser 


Lys 


Glu 
635 


Glu 


Arg 


Lys 


Glu 


Glu 
640 


Glu 


Glu 


Leu 


Asn 


Phe 
645 


Asn 


Glu 


Asp 


He 


Leu 
650 


Cys 


Pro 


His 


Gly 


Glu 
655 


Leu 


Cys 


He 


Ser 


Glu 
660 


Asn 


Glu 


Arg 


Arg 


Leu 

665 


Val 


Ser 


Lys 


Glu 


Ala 

670 


Trp 


Ser 


Lys 


Leu 


Gin 

675 


Gin 


Tyr 


Phe 


Pro 


Lys 
680 


Ala 


Pro 


Glu 


Phe 


Pro 
685 


Ser 


Tyr 


Lys 


Glu 


Cys 
690 


Cys 


Ser 


Gin 


Cys 


Lys 
695 


He 


Leu 


Glu 


Arg 


Glu 
700 


Gly 


Glu 


Glu 


Asn 


Glu 
705 


Ala 


Leu 


His 


Lys 


Met 
710 


He 


Ala 


Asn 


Glu 


Gin 
715 


Lys 


Thr 


Ser 


Leu 


Pro 
720 


Asn 


Leu 


Phe 


Gin 


Asp 
725 


Lys 


Asn 


Arg 


Pro 


Cys 
730 


Leu 


Ser 


Asn 


Trp 


Pro 
735 



11/59 



wo 03/000844 



PCT/US02/19360 



Glu Asp Thr Asp Val Leu Tyr He Val Ser Gin Phe Phe Val Glu 

740 745 750 

Glu Trp Arg Lys Phe Val Arg Lys Pro Thr Arg Cys Ser Pro Val 

755 760 765 

Ser Ser Val Gly Asn Ser Ala Leu Leu Cys Pro His Gly Gly Leu 

770 775 780 

Met Phe Thr Phe Ala Ser Met Thr Lys Glu Asp Ser Lys Leu He 

785 790 795 

Ala Leu He Trp Pro Ser Glu Trp Gin Met He Gin Lys Leu Phe 

800 805 810 

Val Val Asp His Val He Lys He Thr Arg He Glu Val Gly Asp 

815 820 825 

Val Asn Pro Ser Glu Thr Gin Tyr He Ser Glu Pro Lys Leu Cys 

830 835 840 

Pro Glu Cys Arg Glu Gly Leu Leu Cys Gin Gin Gin Arg Asp Leu 

845. 850 855 

Arg Glu Tyr Thr Gin Ala Thr He Tyr Val His Lys Val Val Asp 

860 865 870 

Asn Lys Lys Val Met Lys Asp Ser Ala Pro Glu Leu Asn Val Ser 

875 880 885 

Ser Ser Glu Thr Glu Glu Asp Lys Glu Glu Ala Lys Pro Asp Gly 

890 895 900 

Glu Lys Asp Pro Asp Phe Asn Gin Ser Asn Gly Gly Thr Lys Arg 

905 910 915 

Gin Lys He Ser His Gin Asn Tyr He Ala Tyr Gin Lys Gin Val 

920 925 930 

He Arg Arg Ser Met Arg His Arg Lys Val Arg Gly Glu Lys Ala 

935 940 945 

Leu Leu Val Ser Ala Asn Gin Thr Leu Lys Glu Leu Lys He Gin 

950 955 960 

He Met His Ala Phe Ser Val Ala Pro Phe Asp Gin Asn Leu Ser 

965 970 975 

He Asp Gly Lys He Leu Ser Asp Asp Cys Ala Thr Leu Gly Thr 

980 985 990 

Leu Gly Val He Pro Glu Ser Val He Leu Leu Lys Ala Asp Glu 

995 1000 1005 

Pro He Ala Asp Tyr Ala Ala Met Asp Asp Val Met Gin Val Cys 
1010 1015 1020 

Met Pro Glu Glu Gly Phe Lys Gly Thr Gly Leu Leu Gly His 
1025 1030 



<210> 6 

<211> 1236 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 5969491CD1 

<400> 6 



Met 


Phe 


Gly Asp 


Leu 


Phe 


Glu, Glu Glu 


Tyr 


Ser Thr Val 


Ser 


Asn 


1 






5 










10 








15 


Asn 


Gin 


Tyr Gly 


Lys 

20 


Gly 


Lys 


Lys 


Leu 


Lys 

25 


Thr Lys 


Ala 


Leu 


Glu 
30 


Pro 


Pro 


Ala Pro 


Arg 
35 


Glu 


Phe 


Thr 


Asn 


Leu 
40 


Ser Gly 


He 


Arg 


Asn 

45 


Gin Gly 


Gly Thr 


Cys 


Tyr 


Leu 


Asn 


Ser 


Leu 


Leu Gin 


Thr 


Leu 


His 








50 










55 








60 


Phe 


Thr 


Pro Glu 


Phe 
65 


Arg 


Glu 


Ala 


Leu 


Phe 
70 


Ser Leu 


Gly 


Pro 


Glu 
75 


Glu 


Leu 


Gly Leu 


Phe 
80 


Glu 


Asp 


Lys 


Asp 


Lys 
85 


Pro Asp 


Ala 


Lys 


Val 
90 
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Arg 


He 


He 


Leu 


Leu 


Asp 


Phe 


Gly 


Trp 


Glu 


Leu 


Asn 


Gly 


Thr 


Ser 


He 


Val 


Asn 


Arg 


Gin 


Glu 


Ser 


Gly 


Leu 


Val 


Phe 


Asp 


Leu 


Val 


Lys 


Phe 


Leu 


Thr 


Cys 


Glu 


Arg 


He 


Asn 


Leu 


Glu 


Tyr 


He 


Cys 


Tyr 


Gly 


Leu 


Gly 


Asn 


Asn 


Leu 


Lys 


Met 


He 


Leu 


Pro 


Val 


Asp 


Ser 


Trp 


Asn 


Phe 


Leu 


Gin 


Ser 


Thr 


Val 


Asp 


Phe 


Gin 


Glu 


Ser 


Pro 


Asp 


He 


Asn 


Glu 


Gin 


Gin 


Arg 


Lys 


Ser 


Arg 


Tyr 


Gly 


Asn 


He 


Glu 


Asn 


Thr 


Phe 


Phe 


Asn 


Gly 


Trp 


Asp 


Leu 



Pro 


Leu 


Gin 




95 




Gin 


Glu 


Ala 




110 




Thr 


Ser 


Asn 




125 




Arg 


He 


Leu 




140 




Gly 


His 


Asp 




155 




Gin 


He 


Val 




170 




Asp 


Phe 


Leu 




185 




Glu 


Asp 


Ala 




200 




Cys 


Asp 


Asn 




215 




Ala 


Ala 


Lys 




230 




Val 


Ser 


Leu 




245 




Tyr 


Lys 


Glu 




260 




Lys 


Pro 


Phe 




275 




Tyr 


Asp 


Leu 




290 




Gly 


His 


Tyr 




305 




Trp 


Gin 


Phe 




320 




Asp 


Leu 


Gin 




335 




Lys 


Ala 


He 




350 




Gin 


Leu 


Gly 




365 




Lys 


Lys 


Tyr 




380- 




Leu 


His 


Ser 




395 




Arg 


Leu 


Leu 




410 




Arg 


Asn 


Asp 




425 




Gly 


Leu 


Asn 




440 




Asp 


Ser 


Lys 




455 




Phe 


Gin 


Gly 




470 




Gin 


Leu 


Gin 




485 




Val 


Pro 


Cys 




500 




Leu 


Gin 


Thr 




515 




Glu 


Leu 


His 




530 




Ala 


Leu 


His 




545 




Thr 


Phe 


Asp 



Leu Gin Arg 
Ala Ser Thr 
Glu Glu Met 
Phe Ser Ala 
Leu He Tyr 
Cys Lys Glu 
Asp Leu Thr 
Leu Trp Asn 
Leu Tyr His 
Ser Ala Lys 
Leu Arg Phe 
Thr Ser Cys 
Cys Glu Gin 
Phe Ser Val 
His Val Tyr 
Gin Glu Glu 
Ser Glu Glu 
Leu Leu Glu 
Gin Lys Leu 
Arg Lys Gin 
Gin He Phe 
Lys Asn Ser 
Gin Gin He 
Asn Ser He 
Val Gin Pro 
Lys Glu Ser 
Arg Pro Pro 
His Leu Leu 
Lys Arg Ala 
Leu His Leu 
Pro Val Val 
Lys Arg Lys 



Leu 


Phe 


Ala 


100 






Ala Asp 


Leu 


115 






Arg Gin 


His 


130 






Leu 


Glu 


Thr 


145 






Arg Leu 


Tyr 


160 






Cys 


Lys 


Asn 


175 






Val 


Ala 


Val 


190 






Met 


Tyr 


Val 


205. 






Cys 


Gly 


Thr 


220 






Leu 


Arg 


Lys 


235 






Asn 


Phe 


Asp 


250 






Tyr 


Thr 


Phe 


265 






Ser 


Glu 


Leu 


280 






He 


He 


His 


295 






He 


Lys 


Asp 


310 






Lys 


Ser 


Lys 


325 






Glu 


He 


Asp 


340 






Glu 


Glu 


Asn 


355 






Leu 


Lys 


Lys 


370 






His 


Gly 


Pro 


385 






Leu 


Leu 


Ser 


400 






Ser 


Leu 


Gin 


415 




* 


Phe 


Lys 


Met 


430 






Ser 


Cys 


Pro 


445 






He 


Arg 


Glu 


460 






Ala 


Tyr 


Met 


475 






Glu 


Ala 


Arg 


490 






Asn 


Glu 


Met 


505 






Glu 


Cys 


Asp 


520 






Gly 


Pro 


Gin 


535 






Ser 


Gin. 


Thr 


550 






Thr 


Leu 


Gly 



Gin 


Leu 


Leu 






105 


Thr 


Asp 


Ser 






120 


Asp 


Val 


Gin 






135 


Ser 


Leu 


Val 






150 


His 


Gly 


Thr 






165 


Val 


Ser 


Glu 






180 


Lys 


Asn 


Val 






195 


Glu 


Glu 


Glu 






210 


Cys 


Asp 


Arg 






225 


Leu 


Pro 


Pro 






240 


Phe 


Val 


Lys 






255 


Pro 


Leu 


Arg 






270 


Asp 


Asp 


Leu 






285 


Lys 


Gly 


Gly 






300 


Val 


Asp 


His 






315 


Pro 


Asp 


Val 






330 


His 


Pro 


Leu 






345 


Asn 


Leu 


He 






360 


He 


Gly 


He 






375 


Leu 


Arg 


Lys 






390 


Ser 


Asp 


Glu 






405 


Ala 


Glu 


Ser 






420 


Leu 


Pro 


Pro 






435 


His 


Trp 


Phe 






450 


Lys 


Asp 


He 






465 


Leu 


Phe 


Tyr 






480 


Ala 


Asn 


Pro 






495 


Asp 


Ala 


Ala 






510 


Ser 


Ala 


Asn 






525 


Tyr 


His 


Phe 






540 


Glu 


Ser 


Val 






555 


Asp 


Leu 


Arg 
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560 565 570 

Gin Ser lie Phe Gin Leu Leu Glu Phe Trp Glu Gly Asp Met Val 
575 580 585 

Leu Ser Val Ala Lys Leu Val Pro Ala Gly Leu His He Tyr Gin 
590 595 600 

Ser Leu Gly Gly Asp Glu Leu Thr Leu Cys Glu Thr Glu He Ala 
605 610 615 

Asp Gly Glu Asp He Phe Val Trp Asn Gly Val Glu Val Gly Gly 
620 625 630 

Val His He Gin Thr Gly He Asp Cys Glu Pro Leu Leu Leu Asn 
635 640 645 

Val Leu His Leu Asp Thr Ser Ser Asp Gly Glu Lys Cys Cys Gin 
650 655 660 

Val He Glu Ser Pro His Val Phe Pro Ala Asn Ala Glu Val Gly 
665 670 675 

Thr Val Leu Thr Ala Leu Ala He Pro Ala Gly Val He Phe He 
680 685 690 

Asn Ser Ala Gly Cys Pro Gly Gly Glu Gly Trp Thr Ala He Pro 
695 700 705 

Lys Glu Asp Met Arg Lys Thr Phe Arg Glu Gin Gly Leu Arg Asn 
710 715 720 

Gly Ser Ser He Leu He Gin Asp Ser His Asp Asp Asn Ser Leu 
725 730 735 

Leu Thr Lys Glu Glu Lys Trp Val Thr Ser Met Asn Glu He Asp 
740 745 750 

Trp Leu His Val Lys Asn Leu Cys Gin Leu Glu Ser Glu Glu Lys 
755 760 765 

Gin Val Lys He Ser Ala Thr Val Asn Thr Met Val Phe Asp He 
" 770 775 780 

Arg He Lys Ala He Lys Glu Leu Lys Leu Met Lys Glu Leu Ala 
785 790 • 795 

Asp Asn Ser Cys Leu Arg Pro He Asp Arg Asn Gly Lys Leu Leu 
BOO 805 810 

Cys Pro Val Pro Asp Ser Tyr Thr Leu Lys Glu Ala Glu Leu Lys 
815 820 825 

Met Gly Ser Ser Leu Gly Leu Cys Leu Gly Lys Ala Pro Ser Ser 
830 835 840 

Ser Gin Leu Phe Leu Phe Phe Ala Met Gly Ser Asp Val Gin Pro 
845 850 855 

Gly Thr Glu Met Glu He Val Val Glu Glu Thr He Ser Val Arg 
860 865 870 

Asp Cys Leu Lys Leu Met Leu Lys Lys Ser Gly Leu Gin Gly Asp 
875 880 885 

Ala Trp His Leu Arg Lys Met Asp Trp Cys Tyr Glu Ala Gly Glu 
890 895 900 

Pro Leu Cys Glu Glu Asp Ala Thr Leu Lys Glu Leu Leu He Cys 
905 910 915 

Ser Gly Asp Thr Leu Leu Leu He Glu Gly Gin Leu Pro Pro Leu 
920 925 930 

Gly Phe Leu Lys Val Pro He Trp Trp Tyr Gin Leu Gin Gly Pro 
935 940 . 945 

Ser Gly His Trp Glu Ser His Gin Asp Gin Thr Asn Cys Thr Ser 
950 955 960 

Ser Trp Gly Arg Val Trp Arg Ala Thr Ser Ser Gin Gly Ala Ser 
965 970 • 975 

Gly Asn Glu Pro Ala Gin Val Ser Leu Leu Tyr Leu Gly Asp He 
980 985 990 

Glu He Ser Glu Asp Ala Thr Leu Ala Glu Leu Lys Ser Gin Ala 
995 1000 1005 

Met Thr Leu Pro Pro Phe Leu Glu Phe Gly Val Pro Ser Pro Ala 
1010 1015 1020 

His Leu Arg Ala Trp Thr Val Glu Arg Lys Arg Pro Gly Arg Leu 
1025 1030 1035 
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Leu 


Arg 


Thr 


Asp Arg 
1040 


Gin 


Pro 


Leu 


Arg 


He 


Glu 


He Cys 
1055 


Leu 


Glu 


Pro 


Gly 


Pro 


Gin 


Asp Val 
1070 


Leu 


Leu 


Arg 


Glu 


Arg 


Thr 


Tyr Ala 
1085 


Pro 


Ala 


Leu 


Gin 


Gly 


Gly 


Thr Ala 
1100 


Gly 


Ser 


Leu 


Tyr 


Arg 


Leu 


Pro Val 
1115 


Glu 


Lys 


lie 


Glu 


Lys 


Phe 


Glu Trp 
1130 


Leu 


Pro 


He 


Thr 


Lys 


Arg 


Lys Lys 
1145 


Lys 


Lys 


Lys 


Pro 


Tyr 


Tyr 


Leu Lys 

1160 


Asp 


Gly 


Asp 


Leu 


He 


Asp 


Asp Asp 
1175 


Asp 


Asp 


Phe 


Gly 


Lys 


Glu 


Lys Gin 
1190 


Lys 


Gin 


Arg 


Gin 


Glu 


Ala 


Leu His 
1205 


Glu 


Gin 


Ser 


Glu 


Thr 


Pro 


Ala Arg 
1220 


Pro 


Arg 


Ala 


His 


Val 


Gly 


Ser Phe 
1235 


Arg 







<210> 7 

<211> 545 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: 74973 67CD1 

<400> 7 



Met 


Leu 


Pro 


Gly 


Ala 


Trp 


Leu 


Leu 


1 








5 








Ala 


Arg 


Pro 


Ala 


Gin 
20 


Pro 


Cys 


Pro 


Gin 


Glu 


Val 


Phe 


Cys 
35 


Ser 


Asp 


Glu 


Asp 


He 


Pro 


Pro 


Tyr 
50 


Thr 


Lys 


Asn 


Phe 


Thr 


Thr 


Leu 


Glu 
65 


Thr 


Arg 


Ala 


Thr 


Lys 


Val 


Val 


Phe 
80 


Leu 


Asn 


Thr 


Asp 


Ala 


Phe 


Gly 


Gly 
95 


Leu 


Pro 


Arg 


Gly 


Ser 


Ser 


Phe 


Leu 

110 


Asn 


Leu 


Ser 


Thr 


Ser 


Leu 


Gly 


Lys 
125 


Leu 


Thr 


Leu 


Leu 


Pro 


Glu 


Gly 


Leu 
140 


Phe 


Gin 


His 


His 


Leu 


Gin 


Gly 


Asn 
155 


Gin 


Leu 


Gin 


Gin 


Pro 


Leu 


Thr 


His 
170 


Leu 


Lys 


Thr 



Arg Glu 


Tyr 


Lys Leu 


Gly Arg 


1045 






1050 


Leu Gin 


Lys 


Gly Glu 


Asn Leu 


1060 






1065 


Thr Gin 


Val 


Arg He 


Pro Gly 


1075 






1080 


Asp Leu 


Val 


Trp Asn 


Ala Ala 


1090 






1095 


Arg Glh 


Arg Val Ala 


Asp Phe 


1105 






1110 


Glu He 


Ala 


Lys Tyr 


Phe Pro 


1120 






1125 


Ser Ser 


Trp Asn Gin 


Gin He 


1135 






1140 


Gin Asp 


Tyr 


Leu Gin 


Gly Ala 


1150 






1155 


Thr He 


Gly Val Lys 


Asn Leu 


1165 






1170 


Ser Thr 


He 


Arg Asp 


Asp Thr 


1180 






* 1185 


Ala Leu 


Gly Arg Arg 


Lys Ser 


1195 






1200 


Ser Tyr 


He 


Leu Ser. 


Ser Ala 


1210 






1215 


Pro Glu 


Thr 


Ser Leu 


Ser He 


1225 






1230 


Trp Thr 


Ser 


Leu Leu 


Leu Leu 


10 






15 


Met Gly 


Cys 


Asp Cys 


Phe Val 


25 






30 


Glu Leu 


Ala 


Thr Val 


Pro Leu 


40 






45 


He He 


Phe 


Val Glu 


Thr Ser 


55 






60 


Phe Gly 


Ser 


Asn Pro 


Asn Leu 


70 






75 


Gin Leu 


Cys 


Gin Phe 


Arg Pro 


85 






90 


Leu Glu 


Asp 


Leu Glu 


Val Thr 


100 






105 


Thr Asn 


He 


Phe Ser 


Asn Leu 


115 






120 


Asn Phe 


Asn 


Met Leu 


Glu Ala 


130 






' 135 


Leu Ala 


Ala 


Leu Glu 


Ser Leu 


145 






150 


Ala Leu 


Pro 


Arg Arg 


Leu Phe 


160 






165 


Leu Asn 


Leu 


Ala Gin 


Asn Leu 


175 






180 
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Leu Ala Gin Leu Pro Glu Glu Leu 
185 

Gin Thr Leu Lys Leu Ser Asn Asn 
200 

Gly Val Phe Gly Lys Leu Gly Ser 
215 

Ser Asn Asn lie Ser Glu Leu Pro 
230 

Phe Cys Leu Glu Arg Leu Trp Leu 
245 

Leu Pro Leu Ser He Phe Ala Ser 

260 

Ser Leu Gin Trp Asn Met Leu Arg 
275 

Ala His Thr Pro Cys Leu Val Gly 
290 

Leu Glu Thr Val Ala Glu Gly Thr 

305 

Arg Ser Leu Met Leu Ser Tyr Asn 
320 

Gly He Phe Arg Asp Leu Glu Glu 
335 

Ser Asn Asn Leu Thr Ala Leu His 
350 

Ser Lys Leu Glu Leu Leu Ser Leu 

365 

Leu Pro Glu Gly He Phe Asp Thr 
380 

Ala Leu His Gly Asn Pro Trp Gin 
395 

Leu Phe Asn Trp Leu Gin Gin Tyr 
410 

Gin Thr Tyr Cys Ala Gly Pro Ala 
425 

Pro Ala Leu Asn Glu Lys Gin Leu 
440 

His Leu Gly Phe Gin Val Thr Trp 
455 

Gly Ser Trp Asp Leu Ala Val Gin 
470 ' 

Cys Thr Tyr Ser Asn Pro Glu Gly 
485 

Gin Ala Gin Cys Arg Trp Leu Asn 
500 

Gly Ser Leu Gly Leu Gin Tyr Asn 
515 

Arg Ser Ser Cys Gly Ser Leu Arg 
530 

Arg Ala Ala Gly Pro 
545 

.<210> 8 
<211> 414 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc^feature 

<223> Incyte ID No: 7632424CD1 

<400> 8 

Met Leu He Leu Thr Lys Thr Ala 
1 5 



Phe 


His 


Pro 


Leu 


Thr Ser 


Leu 




190 








195 


Ala 


Leu 


Ser Gly Leu Pro 


Gin 




205 








210 


Leu 


Gin 


Glu 


Leu 


Phe Leu Asp 




220 








225 


Pro 


Gin 


Val 


Phe 


Ser Gin 


Leu 




235 








240 


Gin Arg. Asn Ala 


He Thr 


His 




250 








255 


Leu Gly Asn Leu Thr Phe 


Leu 




265 








270 


Val 


Leu 


Pro 


Ala 


Gly Leu 


Phe 




280 








285 


Leu 


Ser 


Leu 


Thr 


His Asn 


Gin 




295 








300 


Phe 


Ala 


His 


Leu 


Ser Asn 


Leu 




310 








315 


Ala 


He 


Thr 


His 


Leu Pro 


Ala 




325 








330 


Leu 


Val 


Lys 


Leu 


Tyr Leu Gly 




340 








345 


Pro 


Ala 


Leu 


Phe 


Gin Asn 


Leu 




355 








360 


Ser 


Lys 


Asn 


Gin 


Leu Thr 


Thr 




370 








375 


Asn 


Tyr 


Asn 


Leu 


Phe Asn 


Leu 




385 








390 


Cys 


Asp 


Cys 


His 


Leu Ala 


Tyr 




400 








405 


Thr Asp Arg Leu 


Leu Asn 


He 




415 








420 


Tyr 


Leu 


Lys 


Gly Gin Val 


Val 




430 








435 


Val 


Cys 


Pro 


Val 


Thr Arg Asp 




445 








450 


Pro 


Asp 


Glu 


Ser 


Lys Ala 


Gly 




460 








465 


Glu 


Arg 


Ala 


Ala 


Arg Ser 


Gin 




475 








480 


Thr 


Val 


Val 


Leu 


Ala Cys 


Asp 




490 








495 


Val 


Gin 


Leu 


Ser 


Pro Arg 


Gin 




505 








510 


Ala 


Ser 


Gin Glu Trp Asp Leu 




520 








525 


Leu 


Thr 


Val 


Ser 


He Glu 


Ala 




535 








540 



Gly Val Phe Phe Lys Pro Ser 
10 15 



16/59 



wo 03/000844 



PCT/US02/19360 



Lys 


Arg 


Lys Val 


Tyr 








20 


Gly 


Thr 


Leu Phe 


Leu 








35 


Cys 


Asp 


Asp Thr 


Ala 








50 


Leu 


Gly 


Glu Ala 


He 








65 


Gly 


Gly 


He Val 


Pro* 








80 


lie 


Gin 


Arg He 


Val 








95 


Pro 


Ser 


Asp Leu 


Ser 








110 


Ala 


Leu 


Ser Leu 


Gly 








125 


Gly 


Gin 


Leu Lys 


Lys 








140 


His 


Ala 


Leu Thr 


He 








155 


Leu 


Val 


Leu Leu 


He 








170 


Gin 


Gly 


Val Ser 


Asp 








185 


Ala 


Pro 


Gly Asp 


Met 








200 


He 


Lys 


His Pro 


Glu 








215 


Glu 


His 


Leu Ala 


Lys 








230 


Pro 


Pro 


Leu His 


His 








245 


Leu 


Gin 


His Val 


Thr 








260 


Glu 


Gly 


He Glu 


Lys 








275 


Ala 


Ala 


Thr Val 


Gin 








290 


Thr 


His 


Arg Ala 


He 








305 


Gin 


Asn 


Asn Ala 


Val 








320 


Phe 


Tyr 


He Arg 


Arg 








335 


Cys 


Thr 


Leu Leu 


Cys 








350 


He 


Met 


He Ala 


Trp 








365 


Gly 


He 


Leu His 


Asp 








380 


Pro 


Leu 


Gly Val 


Asp 








395 


Lys 


Val 


Pro Gin 


Leu 








410 



<210> 9 

<211> 611 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc„feature 
<223> Incyte ID No: 



Glu 


Phe 


Leu 


Arg 


Ser 
25 


His 


Lys 


He 


Val 


Leu 
40 


Ala 


Ala 


Val 


Val 


Asp 

55 


His 


Ser 


Gin 


Thr 


Glu 
70 


Pro 


Ala 


Ala 


Gin 


Gin 
85 


Gin 


Glu 


Ala 


Leu 


Ser 
100 


Ala 


He 


Ala 


Thr 


Thr 
115 


Val 


Gly 


Leu 


Ser 


Phe 
130 


Pro 


Phe 


He 


Pro 


He 
145 


Arg 


Leu 


Thr 


Asn 


Lys 
160 


Ser 


Gly 


Gly 


His 


Cys 
175 


Phe 


Leu 


Leu 


Leu 


Gly 
190 


Leu 


Asp 


Lys 


Val 


Ala 
205 


Cys 


Ser 


Thr 


Met 


Ser 
220 


Gin 


Gly 


Asn 


Arg 


Phe 
235 


Ala 


Lys 


Asn 


Cys 


Asp 
250 


Asp 


Lys 


He 


He 


Met 

265 


Gly 


Gin 


He 


Leu 


Ser 
280 


His 


Thr 


Met 


Ala 


Cys 
295 


Leu 


Phe 


Cys 


Lys 


Gin 
310 


Leu 


Val 


Ala 


Ser 


Gly 
325 


Ala 


Leu 


Glu 


He 


Leu 
340 


Pro 


Pro 


Pro 


Arg 


Leu 
355 


Asn 


Gly 


He 


Glu 


Arg 

370 


He 


Glu 


Gly 


He 


Arg 
385 


He 


Ser 


Lys 


Glu 


Val 
400 


Lys 


Met 


Glu 


He 





1804436CD1 



Phe 


Asn 


Phe 


His 


Pro 
30 


Gly 


He 


Glu 


Thr 


Ser 
45 


Glu 


Thr 


Gly 


Asn 


Val 
60 


Val 


His 


Leu 


Lys 


Thr 
75 


Leu 


His 


Arg 


Glu 


Asn 
90 


Ala 


Ser 


Gly 


Val 


Ser 
105 


He 


Lys 


Pro 


Gly 


Leu 
120 


Ser 


Leu 


Gin 


Leu 


Val 
135 


His 


His 


Met 


Glu 


Ala 
150 


Val 


Glu 


Phe 


Pro 


Phe 

165 


Leu 


Leu 


Ala 


Leu 


Val 
180 


Lys 


Ser 


Leu 


Asp 


He 
195 


Arg 


Arg 


Leu 


Ser 


Leu 
210 


Gly 


Gly 


Lys 


Ala 


He 
225 


His 


Phe 


Asp 


He 


Lys 
240 


Phe 


Ser 


Phe 


Thr 


Gly 
255 


Lys 


Lys 


Glu 


Lys 


Glu 

270 


Ser 


Ala 


Ala 


Asp 


He 
285 


His 


Leu 


Val 


Lys 


Arg 
300 


Arg 


Asp 


Leu 


Leu 


Pro 
315 


Gly 


Val 


Ala 


Ser 


Asn 
330 


Thr 


Asn 


Ala 


Thr 


Gin 
345 


Cys 


Thr 


Asp 


Asn 


Gly 
360 


Leu 


Arg 


Ala 


Gly 


Leu 

375 


Tyr 


Glu 


Pro 


Lys 


Cys 
390 


Gly 


Glu 


Ala 


Ser 


He 
405 
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<400> 9 

Met Ser Cys Lys Lys Gin Arg Ser Arg Lys His Ser Val Asn Glu 
1 5 10 15 

Lys Cys Asn Met Lys lie Glu His Tyr Phe Ser Pro Val Ser Lys 
20 25 30 

Glu Gin Gin Asn Asn Cys Ser Thr Ser Leu Met Arg Met Glu Ser 
35 40 45 

Arg Gly Asp Pro Arg Ala Thr Thr Asn Thr Gin Ala Gin Arg Phe 
50 55 60 

His Ser Pro Lys Lys Asn Pro Glu Asp Gin Thr Met Pro Gin Asn 
65 70 75 

Arg Thr lie Tyr Val Thr Leu Lys Val Asn His Arg Arg Asn Gin 
80 85 90 

Asp Met Lys Leu Lys Leu Thr His Ser Glu Asn Ser Ser Leu Tyr 
95 100 105 

Met Ala Leu Asn Thr Leu Gin Ala Val Arg Lys Glu lie Glu Thr 
110 115 120 

His Gin Gly Gin Glu Met Leu Val Arg Gly Thr Glu Gly lie Lys 
125 130 135 

Glu Tyr lie Asn Leu Gly Met Pro Leu Ser Cys Phe Pro Glu Gly 
. 140 145 150 

Gly Gin Val Val lie Thr Phe Ser Gin Ser Lys Ser Lys Gin Lys 
155 160 165 

Glu Asp Asn His lie Phe Gly Arg Gin Asp Lys Ala Ser Thr Glu 
170 175 180 

Cys Val Lys Phe Tyr lie His Ala He Gly He Gly Lys Cys Lys 
185 190 195 

Arg Arg He Val Lys Cys Gly Lys Leu His Lys Lys Gly Arg Lys 
200 205 210 

Leu Cys Val Tyr Ala Phe Lys Gly Glu Thr He Lys Asp Ala Leu 
215 220 225 

Cys Lys Asp Gly Arg Phe Leu Ser Phe Leu Glu Asn Asp Asp Trp 
230 235 .240 

Lys Leu He Glu Asn Asn Asp Thr He Leu Glu Ser Thr Gin Pro 
245 250. 255 

Val Asp Glu Leu Glu Gly Arg Tyr Phe Gin Val Glu Val Glu Lys 
260 265 270 

Arg Met Val Pro Ser Ala Ala Ala Ser Gin Asn Pro Glu Ser Glu 
275 280 285 

Lys Arg Asn Thr Cys Val Leu Arg Glu Gin He Val Ala Gin Tyr 
290 295 300 

Pro Ser Leu Lys Arg Glu Ser Glu Lys He He Glu Asn Phe Lys 
305 310 315 

Lys Lys Met Lys Val Lys Asn Gly Glu Thr Leu Phe Glu Leu His 
320 325 330 

Arg Thr Thr Phe Gly Lys Val Thr Lys Asn Ser Ser Ser He. Lys 
335 . 340 345 

Val Val Lys Leu Leu Val Arg Leu Ser Asp Ser Val Gly Tyr Leu 
350 355 360 

Phe Trp Asp Ser Ala Thr Thr Gly Tyr Ala Thr Cys Phe Val Phe 
365 370 375 

Lys Gly Leu Phe He Leu Thr Cys Arg His Val He Asp Ser He 
380 385 390 

Val Gly Asp Gly He Glu Pro Ser Lys Trp Ala Thr He He Gly 
395 400 405 

Gin Cys Val Arg Val Thr Phe Gly Tyr Glu Glu Leu Lys Asp Lys 
410 415 420 

Glu Thr Asn Tyr Phe Phe Val Glu Pro Trp Phe Glu He His Asn 
425 430 435 

Glu Glu Leu Asp Tyr Ala Val Leu Lys Leu Lys Glu Asn Gly Gin 
440 ' 445 450 

Gin Val Pro Met Glu Leu Tyr Asn Gly He Thr Pro Val Pro Leu 
455 460 465 
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Ser 


Gly 


Leu He 


His He He Gly 








470 


Gin 


He 


Asp Ala 


Cys Ala Val He 








485 


Lys 


Cys 


Gin Glu 


Arg Val Gin Ser 








500 


Tyr 


Val 


His Met 


Tyr Thr Gin Arg 








515 


Asn 


Pro 


Asp Val 


He Thr Tyr Asp 








530 


Ser 


Gly 


Ser Pro 


Val Phe Asp Ser 








545 


His 


Ala 


Ala Gly 


Phe Ala Tyr Thr 










He 


lie 


Glu Phe 


Gly Ser Thr Met 








575 


Lys 


Gin 


Arg His 


Lys Pro Trp Tyr 








590 


Gin 


Asp 


Val Glu 


Met Met Ser Asp 



605 

<210> 10 
<211> 147 
<212> PRT 

<213> Homo sapiens 



His 


Pro 


Tyr Gly Glu Lys Lys 




475 


480 


Pro 


Gin 


Gly Gin Arg Ala Lys 




A an 
490 


495 


uys 


Lys 


Ala Glu Ser Pro Glu 




505 


510 


Ser 


Phe 


Gin Lys He Val His 




520 


525 


Thr 


Glu 


Phe Phe Phe Gly Ala 




535 


540 


Lys Gly 


Ser Leu Val Ala Met 




550 


555 


Tyr 


Gin 


Asn Glu Thr Arg Ser 




565 


570 


Glu 


Ser 


He Leu Leu Asp He 




580 


585 


Glu 


Glu 


Val Phe Val Asn Gin 




595 


600 


Glu Asp 


Leu 




6.10 





<220> 

<221> misc^f eature 

<223> Incyte ID No: 7486358CD1 



<400> 10 




Met 


Trp 


Ser 


Leu Pro Pro Ser 


1 






5 


Leu 


Leu 


Leu 


Phe Ser Phe Gin 








20 


Phe 


Gin 


Glu 


Glu Glu Glu Trp 








35 


Tyr 


Leu 


Pro 


Pro Thr Leu Glu 








50 


Gin 


Ser 


Lys 


Asp Trp Tyr Ala 








65 


Ser 


Trp 


Lys 


Glu Gin Gly Tyr 








80 


Leu 


Gin 


Leu 


Gly Arg Thr Met 








95 


Asp 


Asn 


Cys 


Pro Phe Gin Glu 








110 


Thr 


Cys 


Phe 


Phe Thr He Gly 








125 


Asp 


Leu 


Trp 


Asn Lys Thr Cys 








140 



Arg Ala Leu 


Ser Cys 


Ala Pro Leu 


10 




15 


Phe Leu Val 


Thr Tyr Ala Trp Arg 


25 




30 


Asn Asp Gin 


Lys Gin 


He Ala Val 


40 




45 


Phe Ala Val 


Tyr Thr 


Phe Asn Lys 


55 




60 


Tyr Lys Leu 


Val Pro 


Val Leu Ala 


70 




75 


Asp Lys Met 


Thr Phe 


Ser Met Asn 


85 




90 


Cys Gly Lys 


Phe Glu 


Asp Asp He 


100 




105 


Ser Pro Glu 


Leu Asn 


Asn Thr Cys 


115 




120 


He Glu Pro 


Trp Arg Thr Arg Phe 


130 




135 


Ser Gly Gly 


His Ser 




145 







<210> 11 

<211> 624 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> mis cofeature ■ 

<223> Incyte ID No: 7472344CD1 

<400> 11 

Met Ala Leu Arg Ala Arg Ala Leu Tyr Asp Phe Arg Ser Glu Asn 



19/59 



wo 03/000844 



PCT/US02/19360 



1 .5 10 15 

Pro Gly Glu lie Ser Leu Arg Glu His Glu Val Leu Ser Leu Cys 
20 25 30 

Ser Glu Gin Asp lie Glu Gly Trp Leu Glu Gly Val Asn Ser Arg 
35 40 45 

Gly Asp Arg Gly Leu Phe Pro Ala Ser Tyr Val Gin Val lie Arg 
50 55 60 

Ala Pro Glu Pro Gly Pro Ala Gly Asp Gly Gly Pro Gly Ala Pro 
65 . 70 . 75 

Ala Arg Tyr Ala Asn Val Pro Pro Gly Gly Phe Glu Pro Leu Pro 
80 85 90 

Val Ala Pro Pro Ala Ser Phe Lys Pro Pro Pro Asp Ala Phe Gin 
95 100 105 

Ala Leu Leu Gin Pro Gin Gin Ala Pro Pro Pro Ser Thr Phe Gin 
110 115 120 

Pro Pro Gly Ala Gly Phe Pro Tyr Gly Gly Gly Ala Leu Gin Pro 
125. 130 135 

Ser Pro Gin. Gin Leu Tyr Gly Gly Tyr Gin Ala Ser Gin Gly Ser 
140 145 150 

Asp Asp Asp" Trp Asp Asp Glu Trp Asp Asp Ser Ser Thr Val Ala 
155 160 165 

Asp Glu Pro Gly Ala Leu Gly Ser Gly Ala Tyr Pro Asp Leu Asp 
170 175 180 

Gly Ser Ser Ser Ala Gly Val Gly Ala Ala Gly Arg Tyr Arg Leu 
185 190 195 

Ser Thr Arg Ser Asp Leu Ser Leu Gly Ser Arg Gly Gly Ser Val 
200 205 210 

Pro Pro Gin His His Pro Ser Gly Pro Lys Ser Ser Ala Thr Val 
215 220 225 

Ser Arg Asn Leu Asn Arg Phe Ser Thr Phe Val Lys Ser Gly Gly 
230 235 240 

Glu Ala Phe Val Leu Gly Glu Ala Ser Gly Phe Val Lys Asp Gly 
245 250 ^ 255 

Asp Lys Leu Cys Val Val Leu Gly Pro Tyr Gly Pro Glu "Trp Gin 
260 265 270 

Glu Asn Pro Tyr Pro Phe Gin Cys Thr lie Asp Asp Pro Thr Lys 
275 280 285 

Gin Thr Lys Phe Lys Gly Met Lys Ser Tyr lie Ser Tyr Lys Leu 
290 295 300 

Val Pro Thr His Thr Gin Val Pro Val His Arg Arg Tyr Lys His 
• 305 310 315 

Phe Asp Trp Leu Tyr Ala Arg Leu Ala Glu Lys Phe Pro Val lie 
320 325 330 

Ser Val Pro His Leu Pro Glu Lys Gin Ala Thr Gly Arg Phe Glu 
335 340 345 

Glu Asp Phe lie Ser Lys Arg Arg Lys Gly Leu lie Trp Trp Met 
350 355 360 

Asn His Met Ala Ser His Pro Val Leu Ala Gin Cys Asp Val Phe 
365 370 375 

Gin His Phe Leu Thr Cys Pro Ser Ser Thr Asp Glu Lys Ala Trp 
380 385 390 

Lys Gin Gly Lys Arg Lys Ala Glu Lys Asp Glu Met Val Gly Ala 
395 400 405 

Asn Phe Phe Leu Thr Leu Ser Thr Pro Pro Ala Ala Ala Leu Asp 
410 415 420 

Leu Gin Glu Val Glu Ser Lys lie Asp Gly Phe Lys Cys Phe Thr 
425 430 435 

Lys Lys Met Asp Asp Ser Ala Leu Gin Leu Asn His Thr Ala Asn 
440 445 450 

Glu Phe Ala Arg Lys Gin Val Thr Gly Phe Lys Lys Glu Tyr Gin 
455 460 465 

Lys Val Gly Gin Ser Phe Arg Gly Leu Ser Gin Ala Phe Glu Leu 
470 475 480 



20/59 



wo 03/000844 



Asp 




vjxn Axa 


fne 
485 


Ser 


vax 


Gly Leu 


Till" 




TV gi »^ TV n s 

Asp AX a 


Tyr" 
500 


Asp 


TV T = 

AX a 


xxe CiXy 




Arcf 




Leu 

515 


Asp 


Piro 


vax iMex- 








iixa 
530 


Asn 


File 


Pro Asp 






T en T T^Vi ■V- 
JLtSU. ifli 


Lys 
545 


va± 


Lys 


Vjiu ber 




Lys 




vax 
560 


LrXn 


Lys 


Ala Asp 


Asn 


Tli3r 


Xle S©3r 


r*ne 
575 


AX a 




Leu Ala 




Tl A 

x±e 


TV ^r-sl 

A3rg vaj. 


Arg 
590 


TV MM 

Asp 


Pne 


Lys Ser 


Gin 


Gin 


Gin lie 


lie 
605 


Phe 


Phe 


Gin Lys 


Glu 


Ala 


Leu His 


Lys 
620 


Tyr 


Asp 


Ser Val 



<210> 12 
<211> 283 
<212> PRT 
. <213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7192959CD1 

<400> 12 



Met 


Gly 


Ala 


Ser 


Val 


Ser 


Arg 


Gly Arg 


1 








5 










Pro 


Glu 


Pro 


Glu 


Pro 
20 


Glu 


Glu 


Ala 


Leu 


Pro 


Glu 


Leu 


Leu 


Leu 
35 


Val 


Val 


Leu 


Ser 


Leu 


Leu 


Gly 


Arg 


Cys 
50 


Arg 


Gin 


Val 


Cys 


Val 


Asp 


Gly 


Gin 


Ala 
65 


Leu 


Trp 


Leu 


Leu 


Gly 


Ala 


Thr 


Gly 


Arg 
80 


Ala 


Leu 


Leu 


His 


Ser 


Pro 


Ala 


Arg 


Asn 
95 


Ala 


Arg 


Pro 


Cys 


Ala 


Arg 


Arg 


Pro 


He 
110 


Gly 


Arg 


Asn 


Leu 


Gin 


Glu 


Gly 


Leu 


Arg 
125 


Lys 


Trp 


Met 


Val 


Trp 


Val 


Val 


Glu 


Glu. 
140 


Asn 


Arg 


Thr 


Thr 


Gin 


Thr 


Cys 


Phe 


Val 
155 


Thr 


Ser 


Phe 


Ser 


Val 


Leu 


Asp 


Leu 


Glu 


Glu 


Glu 


Gly Leu 










170 










Ser 


Gly 


Arg 


lie 


Glu 

185 


He 


Cys 


Val 


Ser 


His 


Asp 


Ser 


Gly 


Cys 
200 


Met 


Tyr 


Arg 


Leu 


Ala 


Asn 


Gin 


Thr 


Val 
215 


Leu 


Asp 


Lys 


Phe 


lie 


Pro 


Gin 


Trp 


Asn 


Asn 


Asn 


Ala 


Cys 



230 



PCT/US02/19360 



Asn 


uxn i\xa xxe 


TV 1 

AXa 


Phe 


/ion 






>1 Q 


CjfXU 


Leu Pne Ala 


OjXU 


CjXn 


505 








Asp 


Leu Leu Ala 


Leu 


Tyr 


520 






525 


He 


He His Val 


Gin 


Lys 


535 






540 


Arg Arg His Val 


Glu 


Glu 


550 






555 


Gly He Gin Asp Arg Cys 


565 






570 


Glu 


He His His 


Phe 


His 


580 






585 


Gin 


Met Gin His 


Phe 


Leu 


595 






600 


Val 


Thr Gin Lys 


Leu 


Glu 


610 






615 



Ala Ala Arg Val Pro 


Ala 


10 




15 


Asp 


Leu Ser Gin Leu 


Pro 


25 




30 


His 


Val Pro Pro Arg Thr 


40 




45 


Arg 


Gly Trp Arg Ala 


Leu 


55 




60 


He Leu Ala Arg Asp His 


70 




75 


Leu 


Ala Arg Ser Cys 


Gin 


85 




90 


Pro 


Leu Gly Arg Phe 


Cys 


100 




105 


He 


Arg Asn Pro Cys 


Gly 


115 




120 


Gin His Gly Gly Asp Gly 


130 




135 


Val 


Pro Gly Ala Pro 


Ser 


145 




150 


Trp 


Cys Cys Lys Lys 


Gin 


160 




165 


Trp 


Pro Glu Leu Leu 


Asp 


175 




180 


Asp 


Trp Trp Gly Ala Arg 


190 




195 


Leu 


Val Gin Leu Leu 


Asp 


205 




210 


Ser 


Ala Val Pro Asp 


Pro 


220 




225 


Leu 


His Val Thr His 


Val 


235 




240 



2.1/59 



wo 03/000844 



Phe Ser Asn He Lys Met Gly Val Arg Phe Val Ser Phe Glu His 

245 250 255 

Arg Gly Gin Asp Thr Gin Phe Trp Ala Gly His Tyr Gly Ala Arg 

260 265 270 

Val Thr Asn Ser Ser Val He Val Arg Val Arg Leu Ser 

275 280 

<210> 13 

<211> 142 

<212> PRT 

<213> Homo sapiens 

<220> - 

<221> misc_feature 

<223> Incyte ID No: 6169565CD1 

<400> 13 



Met Ala 


Asn 


Gly 


Lys 


Lys 


Phe 


Leu 


His Leu 


Pro 


Leu 


Leu 


Thr 


Ser 


1 






5 








10 










15 


Phe Pro 


Leu 


Pro 


Arg 
20 


Leu 


Phe 


Pro 


Thr Pro 
25 


Leu 


He 


Cys 


Ser 


Leu 
30 


He Ser 


Val 


Val 


Gly 

35 


He 


Lys 


Gly 


He Gin 
40 


Lys 


Thr 


Pro 


Leu 


Gin 

45 


Thr Leu 


Pro 


Leu 


Tyr Cys 


Ser 


Phe Arg Asp Val 


Thr 


Leu 


He 


His 








50 








55 










60 


Cys' Phe 


Leu 


Leu 


He 
65 


Pro 


His 


Cys 


Pro Met 
70 


Pro 


Leu 


Leu 


Ser 


Arg 
75 


Asp Leu 


Leu 


His 


Lys 


Leu 


Arg 


Gly Phe Leu His 


Leu 


Trp 


Ala 


Leu 








80 








85 










90 


Gly Gin 


Ser 


His 


Pro Tyr 


Leu 


Phe 


Leu Cys 


Gin 


Glu 


Pro 


Lys 


Phe 








95 








100 










105 


Ser Leu 


Pro 


Glu 


Val 
110 


Lys 


Glu 


Pro 


Thr Pro 

115 


Asp 


Leu 


Ser 


He 


He 
120 


Thr Gin 


Thr 


Asn 


Pro 
125 


He 


Val 


Trp 


Ser Thr 
130 


Gin 


He 


Leu 


Gin 


Ser 
135 


Trp Arg 


Pro 


Thr 


Thr 
140 


Pro 


His 

















<210> 14 
<211> 354 , 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7494717CD1 

<400> 14 

Met Ala Glu Ser Pro Thr Glu Glu Ala Ala Thr Ala Gly Ala Gly 
1 5 10 . 15 

Ala Ala Gly Pro Gly Ala Ser Ser Val Ala Gly Val Val Gly Val 

20 25 30 

Ser Gly Ser Gly Gly Gly Phe Gly Pro Pro Phe Leu Pro Asp Val 

35 40 45 

Trp Ala Ala Ala Ala Ala Ala Gly Gly Ala Gly Gly Pro Gly Ser 

50 55 60 

Gly Leu Ala Pro Leu Pro Gly Leu Pro Pro Ser Ala Ala Ala His 

65 70 75 

Gly Ala Ala Leu Leu Ser His Trp. Asp Pro Thr Leu Ser Ser Asp 

80 85 90 

Trp Asp Gly Glu Arg Thr Ala Pro Gin Cys Leu Leu Arg He Lys 

95 100 105 

Arg Asp He Met Ser He Tyr Lys Glu Pro Pro Pro Gly Met Phe 
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wo 03/000844 

110 



Val 


Val 


Pro Asp 


Thr 
125 


Val 


Asp 


Met Thr 


Thr 


Gly 


Pro Phe 


Asp 
140 


Thr 


Pro 


Tyr Glu 


Val 


Phe 


Arg Cys 


Pro 
155 


Pro 


Asp 


Tyr Pro 


Lys 


Leu 


Met Thr 


Thr 
170 


Gly 


Asn 


Asn Thr 


Phe 


Tyr 


Arg Asn 


Gly 
185 


Lys 


Val 


Cys Leu 


Thr 


Gly 


Pro Ala 


Trp 
200 


Ser 


Pro 


Ala Gin 


He 


Ser 


He Gin 


Ser 
215 


Leu 


Met 


Thr Glu 


Pro 


Gly 


Phe Glu 


Gin 
230 


Glu 


Arg 


His Pro 


Asn 


Glu 


Cys He 


Arg 
245 


His 


Glu 


Thr He 


Met 


Met 


Glu Gly 


Lys 
260 


Cys 


Pro 


Cys Pro 


Met 


Glu 


Lys Ser 


Phe 
275 


Leu 


Glu 


Tyr Tyr 


Cys 


Lys 


Asp Arg 


Leu 

290 


His 


Leu 


Gin Gly 


Pile 


Gly 


Glu Lys 


Arg 
305 


Gly 


His 


Phe Asp 


Arg 


Leu 


Gly Leu 


He 
320 


Arg 


Gin 


Lys Val 


Glu 


Asn 


Ala Glu 


Met 
335 


Asp 


Ser 


Asp Ser 


Thr 


Asp 


Leu His 


Gly 


Ser 


Leu 


Arg Val 



350 



PCT/US02/19360 



115 


120 


Lys He 


His Ala Leu He 


130 


135 


Gly Gly Phe Phe Leu Phe 


145 


150 


He His 


Pro Pro Arg Val 


160 


165 


Val Arg 


Phe Asn Pro Asn 


175 


180 


Ser He Leu Gly Thr Trp 


190 


195 


Ser He 


Ser Ser Val Leu 


205 


210 


Asn Pro 


Tyr His Asn Glu 


220 


225 


Gly Asp 


Ser Lys Asn Tyr 


235 


240 


Arg Val 


Ala Val Cys Asp 


250 


255 


Glu Pro Leu Arg Gly Val 


265 


270 


Asp Phe 


Tyr Glu Val Ala 


280 


285 


Gin Thr 


Met Gin Asp Pro 


295 


300 


Tyr Gin 


Ser Leu Leu Met 


310 


315 


Leu Glu 


Arg Leu His Asn 


325 


330 


Ser Ser 


Ser Gly Thr Glu 


340 


345 



<210> 15 
<211> 89 
<212> PRT 
<213> Homo 



sapiens 



<220> 

<221> xnisc_feature 

<223> Incyte ID No: 7497510CD1 

<400> 15 



Met 


Gin 


Leu Gin 


Ala 


Ser 


Leu 


Ser Phe 


Leu Leu 


He 


Leu 


Thr 


Leu 


1 






5 








10 








15 


Cys 


Leu 


Glu Leu 


Arg 


Ser 


Glu 


Leu Ala 


Arg Asp 


Thr 


He 


Lys 


Asp 








20 








25 








30 


Leu 


Leu 


Pro Asn 


Val 


Cys 


Ala 


Phe Pro 


Met Glu 


Lys 


Gly 


Pro 


Cys 








35 








40 








45 


Gin 


Thr 


Tyr Met 


Thr 


Arg 


Trp 


Phe Phe 


Asn Phe 


Glu 


Thr 


Gly Glu 








50 








55 








60 


Cys 


Glu 


Leu Phe 


Ala 


Tyr Gly Gly Cys 


Gly Gly Asn 


Ser 


Asn 


Asn 








65 








70 








75 


Phe 


Leu Arg Lys 


Glu 


Lys 


Cys 


Glu Lys 


Phe Cys 


Lys 


Phe 


Thr 










80 








85 











<210> 16 
<211> 419 
<212> PRT 

<213> Homo sapiens 



<220> 



23/59 



wo 03/000844 



<221> misc_feature 

<223> Incyte ID No: 7498882CD1 

<400> 16 



Met 


Gly Ala Gly 


Pro 


Ser 


Leu 


Leu 


Leu 


Ala 


Ala 


Leu 


Leu 


Leu 


Leu 


1 




5 










10 










15 


Leu 


Ser Gly Asp 


Gly 
20 


Ala 


Val 


Arg 


Cys 


Asp 
25 


Thr 


Pro 


Ala 


Asn 


Cys 
30 


Thr 


Tyr Leu Asp 


Leu 
35 


Leu 


Gly 


Thr 


Trp 


Val 
40 


Phe 


Gin 


Val 


Gly 


Ser 
45 


Ser 


Gly Ser Gin 


Arg 
50 


Asp 


Val 


Asn 


Cys 


Ser 
55 


Val 


Met 


Gly 


Pro 


Gin 
60 


Glu 


Lys Lys Val 


Val 
65 


Val 


Tyr 


Leu 


Gin 


Lys 
70 


Leu 


Asp 


Thr 


Ala 


Tyr 
75 


Asp 


Asp Leu Gly 


Asn 
80 


Ser 


Gly 


His 


Phe 


Thr 
85 


He 


He 


Tyr 


Asn 


Gin 
90 


Gly 


Phe Glu lie 


Val 

95 


Leu 


Asn 


Asp 


Tyr 


Lys 
100 


Trp 


Phe 


Ala 


Phe 


Phe. 

105 


Lys 


Tyr Lys Glu 


Glu 
110 


Gly 


Ser 


Lys 


Val 


Thr 
115 


Thr 


Tyr 


Cys 


Asn 


Glu 
120 


Thr 


Met Thr Gly 


Trp 
125 


Val 


His 


Asp 


Val 


Leu 
130 


Gly 


Arg 


Asn 


Trp 


Ala 
135 


Cys 


Phe Thr Gly 


Lys 
140 


Lys 


Val 


Gly 


Thr 


Ala 
145 


Ser 


Glu 


Asn 


Val 


Tyr 
150 


-Val 


Asn Thr Ala 


His 
155 


Leu 


Lys 


Asn 


Ser 


Gin 
160 


Glu 


Lys 


Tyr 


Ser 


Asn 
165 


Arg 


Leu Tyr Lys 


Tyr 
170 


Asp 


His 


Asn 


Phe 


Val 
175 


Lys 


Ala 


He 


Asn 


Ala 
180 


lie 


Gin Lys Ser 


Trp 
185 


Thr 


Ala 


Thr 


Thr 


Tyr 
190 


Met 


Glu 


Tyr 


Glu 


Thr 
195 


Leu 


Thr Leu Gly 


Asp 

200 


Met 


He 


Arg 


Arg 


Ser 

205 


Gly 


Gly 


His 


Ser 


Arg 

210 


Lys 


lie Pro Arg 


Pro 

215 


Lys 


Pro 


Ala 


Pro 


Leu 
220 


Thr 


Ala 


Glu 


He 


Gin 
225 


Gin 


Lys lie Leu 


His 
230 


Leu 


Pro 


Thr 


Ser 


Trp 
235 


Asp 


Trp 


Arg 


Asn 


Val 
240 


His 


Gly He Asn 


Phe 
245 


Val 


Ser 


Pro 


Val 


Arg 
250 


Asn 


Gin 


Gly 


Cys 


Glu 
255 


Gly 


Gly Phe Pro 


Tyr 
260 


Leu 


He 


Ala 


Gly 


Lys 
265 


Tyr 


Ala 


Gin 


Asp 


Phe 
270 


Gly 


Leu Val Glu 


Glu 
275 


Ala 


Cys 


Phe 


Pro 


Tyr 
280 


Thr 


Gly 


Thr 


Asp 


Ser 
285 


Pro 


Cys Lys Met 


Lys 
290 


Glu 


Asp 


Cys 


Phe 


Arg 
295 


Tyr 


Tyr 


Ser 


Ser 


Glu 
300 


Tyr 


His Tyr Val 


Gly 
305 


Gly 


Phe 


Tyr 


Gly 


Gly 
310 


Cys 


Asn 


Glu 


Ala 


Leu 
315 


Met 


Lys Leu Glu 


Leu 

320 


Val 


His 


His 


Gly 


Pro 

325 


Met 


Ala 


Val 


Ala 


Phe 

330 


Glu 


Val Tyr Asp 


Asp 
335 


Phe 


Leu 


His 


Tyr 


Lys 
340 


Lys 


Gly 


He 


Tyr 


His 
345 


His 


Thr Gly Leu 


Arg 
350 


Asp 


Pro 


Phe 


Asn 


Pro 
355 


Phe 


Glu 


Leu 


Thr 


Asn 
360 


His 


Ala Val Leu 


Leu 
365 


Val 


Gly 


Tyr 


Gly 


Thr 
370 


Asp 


Ser 


Ala 


Ser 


Gly 
375 


Met 


Asp Tyr Trp 


He 
380 


Val 


Lys 


Asn 


Ser 


Trp 
385 


Gly 


Thr 


Gly 


Trp 


Gly 
390 


Glu 


Asn Gly Tyr 


Phe 
395 


Arg 


He 


Arg 


Arg 


Gly 
400 


Thr 


Asp 


Glu 


Cys 


Ala 
405 


lie 


iGlu Ser He 


Ala 
410 


Val 


Ala 


Ala 


Thr 


Pro 
415 


He 


Pro 


Lys 


Leu 





<210> 17 
<211> 951 



24/59 



wo 03/000844 



PCT/US02/19360 



<212> PRT 

<213> Homo sapiens 

<220> 

<221> mis cofeature' 

<223> Incyte ID No: 5524205CD1 

<400> 17 



Met 


Asp 


His 


Gin 


Asn 


Leu 


Ser 


Glu His 


Val Leu 


Cys 


Met 


Val 


Leu 


1 








5 








10 








15 


Tyr 


Leu 


He 


Glu 


Leu 


Gly 


Leu 


Glu Asn 


Ser Ala 


Glu 


Glu 


Glu 


Ser 










20 








25 








30 


Asp 


Glu 


Glu 


Ala 


Ser 


Val 


Gly 


Gly Pro Glu Arg Cys His Asp Ser 










35 








40 








45 


Trp 


Phe 


Pro 


Gly 


Ser 


Asn 


Leu 


Val Ser 


Asn Met 


Arg 


His 


Phe 


He 










50 








55 








60 


Asn 


Tyr 


Val 


Arg 


Val 


Arg 


Val 


Pro Glu 


Thr Ala 


Pro 


Glu 


Val 


Lys 










65 








70 








75 


Arg 


Asp 


Ser 


Pro 


Ala 


Ser 


Thr 


Ser Ser 


Asp Asn 


Leu 


Gly 


Ser 


Leu 










80 








85 








90 


Gin 


Asn 


Ser 


Gly 


Thr 


Ala 


Gin 


Val Phe 


Ser Leu 


Val 


Ala 


Glu 


Arg 










95 








100 








105 


Arg 


Lys 


Lys 


Phe 


Gin 


Glu 


He 


He Asn Arg Ser 


Ser 


Ser 


Glu 


Ala 










110 








115 








120 


Asn 


Gin 


Val 


Val 


Arg 


Pro 


Lys 


Thr Ser 


Ser Lys 


Trp 


Ser 


Ala 


Pro 










125 








130 








135 


Gly 


Ser 


Ala 


Pro 


Gin 


Leu 


Thr 


Thr Ala 


He Leu 


Glu 


He 


Lys 


Glu 










140 








145 








150 


Ser 


He 


Leu 


Ser 


Leu 


Leu 


He 


Lys Leu 


His His 


Lys 


Leu 


Ser Gly 










155 








1.60 






165 


Lys 


Gin 


Asn 


Ser 


Tyr 


Tyr 


Pro 


Pro Trp 


Leu Asp Asp 


He 


Glu 


He 










170 








175 








180 


Leu 


lie 


Gin 


Pro 


Glu 


He 


Pro 


Lys Tyr 


Ser His 


Gly Asp 


Gly 


He 










185 








190 








195 


Thr 


Ala 


Val 


Glu 


Arg 


He 


Leu 


Leu Lys 


Ala Ala 


Ser 


Gin 


Ser 


Arg 










200 








205 








210 


Met 


Asn 


Lys 


Arg 


He 


He 


Glu 


Glu He 


Cys Arg 


Lys 


Val 


Thr 


Pro 










215 








220 








225 


Pro 


Val 


Pro 


Pro 


Lys 


Lys 


Val 


Thr Ala 


Ala Glu 


Lys 


Lys 


Thr 


Leu 










230 








235 








240 


Asp 


Lys 


Glu 


Glu 


Arg 


Arg 


Gin 


Lys Ala Arg Glu Arg Gin Gin Lys 










245 








250 








255 


Leu 


Leu 


Ala 


Glu 


Phe 


Ala 


Ser 


Arg Gin Lys Ser 


Phe 


Met 


Glu 


Thr 










260 








265 








270 


Ala 


Met 


Asp 


Val 


Asp 


Ser 


Pro 


Glu Asn Asp He 


Pro 


Met 


Glu 


He 










275 








280 








285 


Thr 


Thr 


Ala 


Glu 


Pro 


Gin 


Val 


Ser Glu Ala Val 


Tyr 


Asp 


Cys 


Val 










290 








295 








300 


lie 


Cys 


Gly 


Gin 


Ser 


Gly 


Pro 


Ser Ser 


Glu Asp 


Arg 


Pro 


Thr Gly 










305 








310 








315 


. Leu 


Val 


Val 


Leu 


Leu 


Gin 


Ala 


Ser Ser 


Val Leu Gly Gin Cys Arg 










320 








325 








330 


Asp 


Asn 


Val 


Glu 


Pro 


Lys 


Lys 


Leu Pro 


He Ser 


Glu 


Glu 


Glu 


Gin 










335 








340 








345 


He 


Tyr 


Pro 


Trp 


Asp 


Thr 


Cys 


Ala Ala 


Val His 


Asp Val 


Arg 


Leu 










350 








355 








360 


' Ser 


Leu 


Leu 


Gin 


Arg 


Tyr 


Phe 


Lys Asp 


Ser Ser 


Cys 


Leu 


Leu 


Ala 










365 








370 








375 


Val 


Ser 


He 


Gly 


Trp 


Glu 


Gly 


Gly Val 


Tyr Val 


Gin 


Thr 


Cys 


Gly 










380 








385 








390 


His 


Thr 


Leu 


His 


He 


Asp 


Cys 


His Lys 


Ser Tyr 


Met. Glu 


Ser 


Leu 










395 








400 








405 


Arg 


Asn 


Asp 


Gin 


Val 


Leu 


Gin 


Gly Phe Ser Val Asp Lys 


Gly 


Glu 
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PCT/US02/19360 







410 








415 






/OA 

420 


Phe 


Thr Cys Pro 


Leu 


Cys Arg 


Gin 


Phe 


Ala 


Asn 


Ser Val Leu 


Pro 






425 








430 






435 


Cys 


Tyr Pro Gly Ser 


Asn Val 


Glu 


Asn 


Asn 


Pro Trp Gin Arg 


Pro 






440 








445 






450 


Ser 


Asn Lys Ser 


He 


Gin Asp 


Leu 


He 


Lys 


Glu 


Val Glu Glu 


Leu 






455 








460 






465 


Gin Gly Arg Pro Gly 


Ala Phe 


Pro 


Ser 


Glu 


Thr 


Asn Leu Ser 


Lys 






470 








475 






480 


Gin 


Met Glu Ser 


Val 


Met Lys 


Asp 


He 


Lys Asn 


Thr Thr Gin 


Lys 






485 








490 






495 


Lys 


Tyr Arg Asp 


Tyr 


Ser Lys 


Thr 


Pro 


Gly 


Ser 


Pro Asp Asn 


Asp 






500 








505 






510 


Phe 


Leu Phe Met 


Tyr 


Ser Val 


Ala 


Arg 


Thr 


Asn 


Leu Glu Leu 


Glu 






515 








520 






525 


Leu 


He His Arg 


Gly 


Gly Asn 


Leu Cys 


Ser Gly Gly Ala Ser 


Thr 






530 








535 






540 


Ala Gly Lys Arg 


Ser 


Cys Leu 


Asn 


Gin 


Leu 


Phe 


His Val Leu 


Ala 






545 








550 






555 


Leu 


His Met Arg 


Leu 


Tyr Ser 


He Asp 


Ser Glu Tyr Asn Pro 


Txp 






560 








565 






570 


Arg 


Lys Leu Thr 


Gin 


Leu Glu 


Glu Met Asn Pro Gin Leu Gly 


Tyr 






575 








580 






585 


Glu 


Glu Gin Gin 


Pro 


Glu Val 


Pro 


He Leu Tyr His Asp Val 


Thr 






590 








595 






600 


Ser 


Leu Leu Leu 


He 


Gin He 


Leu 


Met 


Met 


Pro 


Gin Pro Leu 


Arg 






605 








610 






615 


Lys 


Asip His Phe 


Thr 


Cys He 


Val 


Lys 


Val 


Leu 


Phe Thr Leu 


Leu 






620 








625 






630 


Tyr 


Thr Gin Ala 


Leu 


Ala Ala 


Leu 


Ser 


Val 


Lys 


Cys Ser Glu 


Glu 






635 








640 






645 


Asp 


Arg Ser Ala 


Trp 


Lys His 


Ala 


Gly Ala 


Leu Lys Lys Ser 


Thr 






650 








655 






660 


Cys 


Asp Ala Glu 


Lys 


Ser Tyr 


Glu 


Val 


Leu 


Leu 


Ser Phe Val 


He 






665 








670 






675 


Ser 


Glu Leu Phe 


Lys 


Gly Lys 


Leu 


Tyr 


His 


Glu Glu Gly Thr 


Gin 






680 








685 






690 


Glu Cys Ala Met 


Val 


Asn Pro 


He 


Ala 


Trp Ser 


Pro Glu Ser 


Met 






695 








700 






705 


Glu 


Lys Cys Leu 


Gin 


Asp Phe 


Cys 


Leu 


Pro 


Phe Leu Arg He 


Thr 






710 








715 






720 


Ser 


Leu Leu Gin 


His 


His Leu 


Phe Gly Glu Asp 


Leu Pro Ser 


Cys 






725 








730 






735 


Gin 


Glu Glu Glu 


Glu 


Phe Ser 


Val 


Leu 


Ala 


Ser 


Cys Leu Gly 


Leu 






740 








745 






750 


Leu 


Pro Thr Phe 


Tyr 


Gin Thr 


Glu 


His 


Pro 


Phe 


He Ser Ala 


Ser 






755 








760 






765 


Cys 


Leu Asp Trp 


Pro 


Val Pro 


Ala 


Phe 


Asp 


He 


He Thr Gin 


Trp 






770 








775 






780 


Cys 


Phe Glu He 


Lys 


Ser Phe 


Thr 


Glu 


Arg 


His 


Ala Glu Gin 


Gly 






785 








790 






795 


Lys 


Ala Leu Leu 


He 


Gin Glu 


Ser Lys 


Trp 


Lys 


Leu Pro His 


Leu 






800 








805 






810 


Leu 


Gin Leu Pro 


Glu 


Asn Tyr 


Asn 


Thr 


He 


Phe 


Gin Tyr Tyr 


His 






815 








820 






825 


Arg 


Lys Thr Cys 


Ser 


Val Cys 


Thr 


Lys 


Val 


Pro 


Lys Asp Pro 


Ala 






830 








835 






840 


Val 


Cys Leu Val 


Cys 


Gly Thr 


Phe 


Val 


Cys 


Leu 


Lys Gly Leu 


Cys 






845 








850 






855 


Cys 


Lys Gin Gin 


Ser 


Tyr Cys 


Glu 


Cys 


Val 


Leu 


His Ser Gin 


Asn 






860 








865 






870 


Cys 


Gly Ala Gly Thr 


Gly He 


Phe 


Leu 


Leu 


He 


Asn Ala Ser 


Val 






875 








880 






885 
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He 


He 


He lie 


Arg 


vaXy nlS ATg IrJie 












Tyr Leu Asp Ala 


■tlXS 


\3±y \sX\l VaXU i^sp 












Lys 


Pro 


Leu Tyr 


j.j.e 


L-yS J-jyS la±U ATy 








920 




Gin 


Trp 


He Ser 


His 


Thr Phe Asp His 








935 




Pro 


His 


Tyr Asn 


Gly 


Leu 








950 





<210>.18 

<211> 668 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc„feature 

<223> Incyte ID No: 7102342CD1 

<400> 18 



Met 


Phe 


Arg 


Leu 


Trp 


Leu 


Leu 


Leu 


Ala 


X 








tr 
D 










Ala 


Ser 


Arg 


Pro 


Gly 

zU 


Phe 


Gin 


Asn 


Ser 


Pro 


Glu 


Lys 


He 


Gin 


Thr 


Asn 


Thr 


Asn 


Tyr 


Glu 


Gin 


He 


Ser 

DU 


Tyr 


He 


He 


Pro 


Thr 


Val 


His 


Leu 


Lys 

O D 


Gin 


Arg 


Tyr 


Phe 


He 


Tyr 


Leu 


Tyr 


Asn 

O A 


Gin 


Gly 


Ser 


Met 


He 


Gin 


Thr 


Gin 


Cys 


Tyr 


Tyr 


Gin 


Gly 


Asp 


Ser 


Met 


Val 


Thr 

Tin 


Leu 


Ser 


Thr 


Cys 


Leu 


Gin 


Phe 


Glu 


Asn 
125 


Val 


Ser 


Tyr 


Gly 


Ala 


Val 


Glu 


Phe 


Gin 
140 


His 


Val 


Leu 


Tyr 


Asn 


Asp 


He 


Ala 


He 
155 


Phe 


He 


Asp 


Arg 


Met 


Asp 


Asp 


Asn 


He 
170 


Phe 


He 


Ser 


Glu 


Pro 


Asp 


Leu 


Phe 


Pro 
185 


Leu 


Tyr 


Leu 


Glu 


Lys 


Thr 


Leu 


Tyr 


Asp 
200 


Tyr 


Trp 


Gly 


Ser 


Asn 


Lys 


Val 


He 


Glu 
215 


He 


Val 


Gly 


Leu 


Gin 


Phe 


Lys 


Val 


Thr 
230 


He 


Val 


Leu 


Ser 


Asp 


Glu 


Asn 


Lys 


He 
245 


Ser 


Thr 


Val 


Gly 


Gin 


Lys 


Phe 


Leu 


Glu 
260 


Trp 


Lys 


Gin 


Ser 


His 


Asp 


He 


Ala 


Tyr 

275 


Leu 


Leu 


He 


Tyr 


Leu 


Gly 


Ala 


Val 


Phe 
290 


Pro 


Gly 


Thr 


Met 


Ala 


Gly 


Val 


Ala 


Leu 
305 


Gin 


Cyis 


Gly 


Pro 
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Cys 


Leu 


Trp 


Gly Ser 


Val 


o 










Q n A 

y uu 


Arg 


Asp 


Leu 


Arg Arg 


Gly 


910 










915 


Tyr 


Lys 


Val 


Leu 


Glu 


Gin 


925 










930 


He 


Asn 


Lys 


Arg 


Trp 


Gly 


940 










945 


Gly 


Leu 


Cys 


Gly Leu 


Leu 












lb 


Leu 


Leu 


Gin 


He 


Val 


*r1 ^ 

lie 


ZD 










O A 

30 


Asp 


Ser 


Ser 


Glu 


He 


Glu 












>1-C 

4b 


ixe 


Asp 


GXU 


Lys 


Leu 


Tyr 












ou 


Leu 


TV T 

Ala 


Asp 


Asn 


Phe 


Met 


/U 










Id 


Asn 


Thr 


Tyr 


Ser 


Ser 


Asp 


85 










90 


Asn 


lie 


Glu 


Gly Tyr 


Pro 


1 f\f\ 
xuu 










1 AC 

lUb 


Ser 


Gly 


Leu 


Arg Gly 


He 


lie 

lib 










1 A 

120 


He 


Glu 


Pro 


Leu 


Glu 


Ser 


130 










IOC 

135 


Lys 


Leu 


Lys 


Asn 


Glu 


Asp 


145 










150 


Ser 


Leu 


Lys 


Glu 


Gin 


Pro 


160 










165 


Lys 


Ser 


Glu 


Pro 


Ala 


Val 


175 










180 


Met 


His 


He 


Val 


Val 


Asp 


lyo 










195 


Asp 


Ser 


Met 


He 


Val 


Thr 


O AC 










Ol A 

zlU 


TV 1 -a 

Ala 


Asn 


Ser 


Met 


Phe 


Thr 


O O A 

220 










one 

22b 


Ser 


Leu 


Glu 


Leu 


Trp 


Ser 


z ob 










O >J A 

z4U 


Glu 


Ala 


Asp 


Glu 


Leu 


Leu 


OCA 










A C C 

zbb 


Tyr 


Leu 


Asn 


Leu 


Arg 


Pro 


o c c 
z bb 










T7 A 
Z /U 


Met 


Asp 


Tyr 


Pro 


Arg 


Tyr 


280 










285 


Cys 


He 


Thr 


Arg 


Tyr 


Ser- 


295 










300 


Ala 


Ser 


Cys 


Cys 


Asp 


Phe 


310 










315 
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Arg 


Thr Cys Val 


Leu 


Lys 


Asp 


Gly Ala 


Lys 


Cys Tyr Lys Gly 


Leu 








320 








325 




330 


Cys 


Cys 


Lys Asp 


Cys 


Gin 


He 


Leu Gin 


Ser 


Gly Val Glu Cys 


Arg 








335 








340 




345 


Pro 


Lys 


Ala His 


Pro 


Glu 


Cys 


Asp He 


Ala 


Glu Asn Cys Asn 


Gly 








350 








355 




360 


Ser 


Ser 


Pro Glu 


Cys 


Gly 


Pro 


Asp He 


Thr 


Leu He Asn Gly 


Leu 








365 








370 




375 


Ser 


Cys 


Lys Asn 


Asn 


Lys 


Phe 


He Cys 


Tyr 


Asp Gly Asp Cys 


His 








380 








385 




390 


Asp 


Leu Asp Ala 


Arg 


Cys 


Glu 


Ser Val 


Phe 


Gly Lys Gly Ser 


Arg 








395 








400 




405 


Asn 


Ala 


Pro Phe 


Ala 


Cys 


Tyr 


Glu Glu 


He 


Gin Ser Gin Ser 


Asp 








410 








415 




420 


Arg 


Phe 


Gly Asn 


Cys 


Gly 


Arg 


Asp Arg 


Asn 


Asn Lys Tyr Val 


Phe 








425 








430 




435 


Cys 


Gly 


Trp Arg 


Asn 


Leu 


He 


Cys Gly 


Arg 


Leu Val Cys Thr 


Tyr 








440 








445 




450 


Pro 


Thr 


Arg Lys 


Pro 


Phe 


His 


Gin Glu 


Asn 


Gly Asp Val He 


Tyr 








455 








460 




465 


Ala 


Phe 


Val Arg 


Asp 


Ser 


Val 


Cys He 


Thr 


Val Asp Tyr Lys 


Leu 








470 








475 




480 


Pro 


Arg 


Thr Val 


Pro 


Asp 


Pro 


Leu Ala 


Val 


Lys Asn Gly Ser 


Gin 








485 








490 




495 


Cys 


Asp 


He Gly 


Arg 


Val 


Cys 


Val Asn 


Arg 


Glu Cys Val Glu 


Ser 








500 








505 




510 


Arg 


He 


He Lys 


Ala 


Ser 


Ala 


His Val 


Cys 


Ser Gin Gin Cys 


Ser 








515 








520 




525 


Gly 


His 


Gly Val 


Cys 


Asp 


Ser 


Arg Asn 


Lys 


Cys His 'Cys Ser 


Pro 








530 








535 




540 


Gly 


Tyr 


Lys Pro 


Pro 


Asn 


Cys 


Gin He 


Arg 


Ser Lys Gly Phe 


Ser 








545 








550 




555 


He 


Phe 


Pro Glu 


Glu 


Asp 


Met 


Gly Ser 


He 


Met Glu Arg Ala 


Ser 








560 








565 




570 


Gly 


Lys 


Thr Glu 


Asn 


Thr 


Trp 


Leu Leu 


Gly 


Phe Leu He Ala 


Leu 








575 








580 




585 


Pro 


He 


Leu He 


Val 


Thr 


Thr 


Ala He 


Val 


Leu Ala Arg Lys 


Gin 








59.0 








595 




600 


Leu 


Lys 


Lys Trp 


Phe 


Ala 


Lys 


Glu Glu 


Glu 


Phe Pro Ser Ser 


Glu 








605 








610 




615 


Ser 


Lys 


Ser Glu 


Gly 


Ser 


Thr 


Gin Thr 


Tyr 


Ala Ser Gin Ser 


Ser 








620 








625 




630 


Ser 


Glu 


Gly Ser 


Thr 


Gin 


Thr 


Tyr Ala 


Ser 


Gin Thr Arg Ser 


Glu 








635 








640 




645 


Ser 


Ser 


Ser Gin 


Ala 


Asp 


Thr 


Ser Lys 


Ser 


Lys Ser Gin Asp 


Ser 








650 








655 




660 


Thr 


Gin 


Thr Gin 


Ser 


Ser 


Ser 


Asn 









665 

<210> 19 
<211> 206 
.<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 4169939CD1 



<400> 19 

Met Met Leu Arg Leu 
1 5 
Ser Gly Tyr Gly Pro 

20 



Leu Ser Ser Leu Leu 
10 

Pro Ser Ser His Ser 
25 



Leu Val Ala Val Ala 
15 

Ser Ser Arg Val Val 
30 
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His 


Gly 


Glu Asp 


Ala 


He 


Pro 


He Asn 








35 








His 


Pro 


Leu Trp 


Asn 


Arg 


Ser 


Cys Val 








50 








Ala 


Leu 


lie Lys 


Leu 


Ser 


Arg 


Ser Ala 








65 








Gin 


Leu 


Ala Ser 


Leu 


Pro 


Pro 


Ala Gly 


Thr 


Pro 


Cys Tyr 


80 
He 


Thr 


Gly 


Trp Gly 








95 








Pro 


Leu 


Pro Asp 


Lys 


Leu 


Gin 


Gin Ala 








110 








Tyr 


Lys 


His Cys 


Ser 


Arg 


Trp 


Asn Trp 








125 








Lys 


Thr 


Met Val 


Cys 


Ala 


Gly 


Gly Tyr 








140 








Gly 


Asp 


Ser Gly 


Gly 


Pro 


Leu 


Asn Cys 








155 








Trp 


Gin 


Val His 


Gly 


Val 


Thr 


Ser Phe 








170 








Asn 


Phe 


lie Trp 


Lys 


Pro 


Thr 


Val Phe 








185 








lie 


Asp 


Trp lie 


Glu 


Glu 


Thr 


He Ala 








200 









<210> 20 

<211> 267 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 6539977CD1 

<400> 20 



Met 


Ser 


Leu 


Arg 


Val 


Leu 


Gly 


Ser Gly 


1 








5 








Lys 


Met 


Phe 


Leu 


Leu 
20 


Leu 


Thr 


Ala Leu 


Met 


Thr 


Arg 


Ser 


Gin 

35 


Glu 


Asp 


Glu Asn 


Thr 


Cys 


Thr 


Arg 


Ser 
50 


Ser 


Gin 


Pro Trp 


Gly 


Pro 


Arg 


Arg 


Arg 
65 


Phe 


Leu 


Cys Gly 


Gin 


Trp 


Val 


He 


Thr 
80 


Ala 


Ala 


His Cys 


Val 


Ala 


Leu 


Gly 


Lys 
95 


His 


Asn 


Leu Arg 


Gin 


Val 


Leu 


Arg 


Val 
110 


Val 


Arg 


Gin Val 


Ser 


Arg 


Thr 


His 


Asp 
125 


Asn 


Asp 


Leu Met 


Pro 


Ala 


Arg 


He 


Gly 
140 


Arg 


Ala 


Val Arg 


Ala 


Cys 


Ala 


Ser 


Pro 

155 


Gly 


Thr 


Ser Cys 


Thr 


He 


Ser 


Ser 


Pro 
170 


He 


Ala 


Arg Tyr 


Val 


Asn 


He 


Asn 


He 
185 


Ser 


Pro 


Asp Glu 


Pro 


Arg 


Thr 


He 


Thr 
200 


Pro 


Gly 


Met Val 
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Ser 


Glu 


Glu Leu Phe 


Val 


40 






45 


Ala 


Cys Gly Asn Asp 


He 


55 






60 


Gin 


Leu Gly Asp Ala 


Val 


70 






75 


Asp 


He 


Leu Pro Asn 


Lys 


85 






90 


Arg 


Leu Tyr Thr Asn 


Gly 


100 






105 


Arg 


Leu 


Pro Val Val 


Asp 


115 






120 


Trp 


Gly Ser Thr Val 


Lys 


130 






135 


He 


Arg Ser Gly Cys 


Asn 


145 






150 


Pro 


Thr Glu Asp Gly 


Gly 


160 






165 


Val 


Ser Gly Phe Gly 


Cys 


175 






180 


Thr 


Arg Val Ser Ala 


Phe 


190 






195 


Ser 


His- 






205 








Thr 


Trp 


Pro Ser Ala 


Pro 


10 




15 


Gin 


Val 


Leu Ala He 


Ala 


25 






30 


Lys 


He 


He Gly Gly 


Tyr 


40 






- 45 


Gin 


Ala 


Ala Leu Leu 


Ala 


55 






60 


Gly 


Ala 


Leu Leu Ser 


Gly 


70 






75 


Gly 


Arg 


Pro He Leu 


Gin 


85 






90 


Arg 


Trp Glu Ala Thr 


Gin 


100 






105 


Thr 


His 


Pro Asn Tyr 


Asn 


115 






120 


Leu 


Leu 


Gin Leu Gin 


Gin 


130 






135 


Pro 


He 


Glu Val Thr 


Gin 


145 






150 


Arg 


Val 


Ser Gly Trp 


Gly 


160 






165 


Pro 


Ala 


Ser Leu Gin 


Cys 


175 






180 


Val 


Cys 


Gin Lys Ala 


Tyr 


190 






195 


Cys 


Ala Gly Val Pro 


Gin 


205 






210- 
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Gly Gly Lys Asp Ser Cys Gin Gly Asp 
215 

Cys Arg Gly Gin Leu Gin Gly Leu Val 

230 

Cys Ala Leu Pro Gly Tyr Pro Gly Val 
245 

Tyr Arg Ser Trp lie Glu Glu Thr Met 
260 

<210> 21 
<211> 86 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7675588CD1 

<400> 21 



Met 


Gly Leu Ser 


Gly 


Leu 


Leu 


Pro 


He 


1 






5 










Leu Gly Asp 


He 


Gin 


Glu 


Pro 


Gly His 








20 










Lys 


Pro Cys 


Pro 


Lys 
35 


lie 


Lys 


Val 


Glu 


Asp 


Gin Cys 


Thr 


Lys 
50 


Pro 


Arg 


Asp 


Cys 


Cys 


Pro Phe 


Ser 


Arg 
65 


Gly 


Lys 


Lys 


Cys 


Ser 


Leu Thr 


Leu 


Tyr 
80 


His 


Lys 


Glu 


Glu 



<210> 22 

<211> 232 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 6244077CD1 

<400> 22 



Met 


Ala 


Asn 


Tyr 


Tyr 


Glu 


Val 


Leu 


Gly 


1 








5 










Pro 


Glu 


Asp 


He 


Lys 
20 


Lys 


Ala 


Tyr 


Arg 


His 


Pro 


Asp 


Lys 


Asn 

35 


Pro 


Asp 


Asn 


Lys 


Phe 


Lys 


Leu 


Val 


Ser 
50 


Glu 


Ala 


Tyr 


Glu 


Lys 


Arg 


Ser 


Leu 


Tyr 
65 


Asp 


Arg 


Ala 


Gly 


Gly 


Gly 


Gly 


Ala 


Ser 
80 


Thr 


Pro 


Tyr 


His 


Tyr 


Thr 


Phe 


Arg 


Asn 
95 


Pro 


Glu 


Asp 


He 


Gly 


Leu 


Asp 


Pro 


Phe 
110 


Ser 


Phe 


Glu 


Phe 


Ser 


Asp 


Arg 


Gly 


Gly 
125 


Arg 


Gly 


His 


Gly 


Ala 


Gly 


Phe 


Gly 


Glu 
140 


Phe 


Pro 


Ala 


Phe 


Phe 


Asn 


Met 


Leu 


Gly 


Cys 


Ser 


Gly Gly 
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Ser 


Gly Gly Pro Leu 


Val 


220 




225 


Ser 


Trp Gly Met Glu 


Arg 


235 




240 


Tyr 


Thr Asn Leu Cys 


Lys 


250 




255 


Arg 


Asp Lys 




265 






Leu 


Val Pro Phe He 


Leu 


10 




15 


Ala 


Glu Gly He Leu 


Gly 


25 




30 


Cys 


Glu Val Glu Glu 


He 


40 




45 


Pro 


Glu Asn Met Lys 


Cys 


55 




60 


Leu 


Asp Phe Arg Lys 


Val 


70 




75 


Leu 


Glu 




85 






Val 


Gin Ala Ser Ala 


Ser 


10 




15 


Lys 


Leu Ala Leu Arg 


Trp 


25 




30 


Glu 


Glu Ala Glu Lys 


Lys 


40 




45 


Val 


Leu Ser Asp Ser 


Lys 


55 




60 


Cys 


Asp Ser Trp Arg 


Ala 


70 




75 


Ser 


Pro Phe Asp Thr 


Gly 


85 




90 


•Phe 


Arg Glu Phe Phe 


Gly 


100 




105 


Trp 


Asp Ser Pro Phe 


Asn 


115 




120 


Leu 


Arg Gly Ala Phe 


Ser 


130 




135 


Met 


Glu Ala Phe Ser 


Ser 


145 




150 


Ser 


His Thr Thr Phe 


Ser 
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160 




165 




Ser Pne 




Gly 


Ser Ser Ser Gly Ser Ser Gly 


Phe 


Lys 






1 / u 




175 




180 


Ser Val 


Met Ser 


Sear 


Thr 


Glu Met lie Asn Gly His Lys 


Val 


Thr 






1 Q C 
J.OD 




190 




195 


Thr Lys 


Arg lie 


VaX 


Glu 


Asn Gly Gin Glu Arg Val Glu Val 


Glu 






200 




205 




210 


Glu Asp Gly Gin 


Leu 


Lys 


Ser Val Thr Val Asn Gly Lys 


Glu 


Gin 






215 




2.20 




225 


Leu Lys 


Trp Met 


Asp 


Ser 


Lys 










230 











<210> 23 














<211> 237 














<212> PRT 














<213> Homo J 


sapiens 










<220> 
















<221> misc_: 


feature 










<223> Incyte ID 


No: 


7498404CD1 




<400> 23 














"Mot- 2^1 a 


Ser 


Leu 


Glu 


Val 


Ser 


Arg Ser Pro Arg Arg 


Ser Arg Arg 


_L 






5 






10 


15 


r^T 1 1 T at 1 


oXU 


Val Arg 


Ser 


Pro 


Arg Gin Asn Lys Tyr 


Ser Val Leu 








20 






25 


30 


LicU irro 


inr 


Tyr Asn 


Glu 


Arg 


Glu Asn Leu Pro Leu 


He Val Trp 








35 






40 


45 


Leu Leu 


Val 


Lys 


Ser 


Phe 


Ser 


Glu Ser Gly He Asn 


Tyr Glu He 








50 






55 


60 


He He 


He 


Asp Asp 


Gly 


Ser 


Pro Asp Gly Thr Arg Asp Val Ala 








65 






70 


75 


Glu Gin 


•Leu 


Glu Lys 


He 


Tyr 


Gly Ser Asp Arg He Leu Leu Arg 








80 






85 


90 


Pro Arg 


Glu 


Lys 


Lys 


Leu 


Gly 


Leu Gly Thr Ala Tyr 


He His Gly 








95 






100 


105 


Met Lys 


His 


Ala 


Thr 


Gly 


Asn 


Tyr He He He Met 


Asp Ala Asp 








110 






115 


120 


Leu Ser 


His 


His 


Pro 


Lys 


Phe 


He Pro Glu Phe He 


Arg Lys Gin 








125 






130 


135 


Lys Glu 


Gly 


Asn 


Phe 


Asp 


He 


Val Ser Gly Thr Arg- Tyr Lys Gly 








140 






145 


150 


Asn Gly 


Gly 


Val 


Tyr 


Gly 


Trp 


Asp Leu Lys Arg Lys 


He He Arg 








155 






150 


165 


Leu Tyr 


Arg 


Lys 


Glu 


Val 


Leu 


Glu Lys Leu He Glu 


Lys Cys Val 








170 






175 


180 


Ser Lys 


Gly 


Tyr Val 


Phe 


Gin 


Met Glu Met He Val 


Arg Ala Arg 








185 






190 


195 


Gin Leu 


Asn 


Tyr Thr 


He 


Gly 


Glu Val Pro He Ser Phe Val Asp 








200 






205 


210 


Arg Val 


Tyr 


Gly Glu 


Ser 


Lys 


Leu Gly Gly Asn Glu 


He Val Ser 








215 






220 


225 


Phe Leu 


Lys 


Gly Leu 


Leu 


Thr 


Leu Phe Ala Thr Thr 










230 






235 





<210> 24 

<211> 145 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7391748CD1 
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<400> 24 

Met Leu Leu Gin Leu Ser Arg Arg Val Arg Arg Asn Arg Asn Val 
15 10 15 

Asn Pro Val Ala Leu Pro Arg Ala Gin Glu Gly Leu Arg Pro Gly 

20 25 30 

Thr Leu Cys Thr Val Ala Gly Trp Gly Arg Val Ser Met Arg Arg 

35 40 45 

Gly Thr Asp Thr Leu Arg Glu Val Gin Leu Arg Val Gin Arg Asp 

50 55 60 

Arg Gin Cys Leu Arg lie Phe Gly Ser Tyr Asp Pro Arg Arg Gin 

65 70 75 

lie Cys Val Gly Asp Arg Arg Glu Arg Lys Ala Ala Phe Lys Gly 

80 85 90 

Asp Ser Gly Gly Pro Leu Leu Cys Asn Asn Val Ala His Gly lie 

95 100 105 

Val Ser Tyr Gly Lys Ser Ser Gly Val Pro Pro Glu Val Phe Thr 
110 115 120 

Arg Val Ser Ser Phe Leu Pro Trp lie Arg Thr Thr Met Arg Ser 
125 130 135 

Phe Lys Leu Leu Asp Gin Met Glu Thr Pro Leu 
140 145 

<210> 25 
<211> 696 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No: 7499780CD1 

<400> 25 

Met Thr Ser Ser Gly Pro Gly Pro Arg Phe Leu Leu Leu Leu Pro 
1 5 10 - .15 

Leu Leu Leu Pro Pro Ala Ala Ser Ala Ser Asp Arg Pro Arg Gly - 

20 25 30 

Arg Asp Pro Val Asn Pro Glu Lys Leu Leu Val He Thr Val- Ala , 

35 40 45 

Thr Ala Glu Thr Glu Gly Tyr Leu Arg Phe Leu Arg Ser Ala Glu 
50 55 60 

Phe Phe Asn Tyr Thr Val Arg Thr Leu Gly Leu Gly Glu Glu Trp 
65 70 75 

Arg Gly Gly Asp Val Ala Arg Thr Val Gly Gly Gly Gin Lys Val 
80 85 90 

Arg Trp Leu Lys Lys Glu Met Glu Lys Tyr Ala Asp Arg Glu Asp 
95 100 105 

Met He He Met Phe Val Asp Ser Tyr Asp Val He Leu Ala Gly 
110 115 120 

Ser Pro Thr Glu Leu Leu Lys Lys Phe Val Gin Ser Gly Ser Arg 
125 130 135 

Leu Leu Phe Ser Ala Glu Ser Phe Cys Trp Pro Glu Trp Gly Leu 
140 145 150 

Ala Glu Gin Tyr Pro Glu Val Gly Thr Gly Lys Arg Phe Leu Asn 
155 160 165 

Ser Gly Gly Phe He Gly Phe Ala Thr Thr He His Gin He Val 
170 175 180 

Arg Gin Trp Lys Tyr Lys Asp Asp Asp Asp Asp Gin Leu Phe Tyr 
185 190 195 

Thr Arg Leu Tyr Leu Asp Pro Gly Leu Arg Glu Lys Leu Ser Leu 
200 205 210 

Asn Leu Asp His Lys Ser Arg He Phe Gin Asn Leu Asn- Gly Ala 
215 220 225 

Leu Asp Glu Val Val Leu Lys Phe Asp Arg Asn Arg Val Arg He 
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230 




Arg 


Asn 


Val 


Ala 


Tyr Asp 
245 


Thr 


Gly 


Pro 


Thr 


Lys 


Leu Gin 
260 


Leu 


Asn 


Gly 


Trp 


Thr 


Pro Glu 
275 


Gly 


Arg 


Arg 


Thr 


Leu 


Pro Gly 
290 


Gly 


He 


Ala 


Asp 


Ser 


Trp Pro 
305, 


Gin 


Lys 


Leu 


Val 


Gly 


Pro Glu 
320 


Glu 


Asp 


Met 


Ala 


Met 


Asp Leu 
335 


Cys 


Tyr 


Phe 


Ser 


Leu 


Asp Ala 
350 


Asp 


Leu 


Arg 


He 


Leu 


He Glu 
365 , 


Glu 


Leu 


Ser 


Arg 


His 


Gly 'Lys 
380 . 


Leu 


Ser 


Pro 


Asp 


Glu 


Tyr Tyr 
395 


Ala 


Val 


Gin 


Arg 


Lys 


Arg Val 
410 


Gly 


Gin 


Ala 


Tyr 


Val 


He Arg 
425 


Gly 


Gin 


Arg 


Asp 


Val 


Phe Ser 
440 


Gly 


Phe 


Cys 


Lys 


Ser 


Phe Arg 
455 


Asp 


Asn 


Gin 


His 


Glu 


Phe Gly 
470 


Arg 


Thr 


Glu 


His 


Leu 


His Pro 
485 


Asp 


Val 


Asp 


Trp 


Lys 


Glu Gin 
500 


Tyr 


Leu 


Glu 


Gly 


Glu 


Gly He 
515 


Val 


Trp 


Phe 


Pro 


Leu 


Leu Ser 
530 


Glu 


Glu 


Met 


Glu 


His 


Tyr Gly 
545 


Gin 


Ser 


Arg 


Leu 


Ala 


Gly Gly 
560 


Tyr 


His 


Met 


Lys 


Gin 


Val Gly 
575 


Tyr 


Arg 


Thr 


Tyr 


Val 


Gly Pro 
590 


Met 


His 


Thr 


Lys 


Ala 


Arg Ala 
605 


Val 


Pro 


Asp 


Glu 


Gin 


Pro Ser 
620 


Leu 


Phe 


Thr 


Leu 


Asn 


Val Ala 

635 


Leu 


Gly 


Gly 


Gly 


Cys 


Arg Phe 

650 


Leu 


Pro 


Arg 


Lys 


Gly 


Trp Ala 
665 


Leu 


Tyr 


His 


Glu 


Gly 


Leu Pro 
680 


Thr 


Val 


Ser 


Phe 


Val 


Asp Pro 
695 









235 




240 


Leu 


Pro 


He 


Val 


Val His Gly Asn 






250 




255 


Asn 


Tyr 


Leu 


Gly 


Asn Tyr Val Pro 






265 




270 


Gly 


Cys 


Gly 


Phe 


Cys Asn Gin Asp 






280 




285 


Gin 


Glu 


Val 


Phe 


His Glu Pro His 






295 




300 


Leu 


Gin 


Asp 


His 


Phe Ser Ala Val 






310 




315 


Ala 


Leu 


Ser 


Pro 


Gly Glu Ala Arg 






325 




330 


Arg 


Gin 


Asp 


Pro 


Glu Cys Glu Phe 






340 




345 


Ala 


Val 


Leu 


Thr 


Asn Leu Gin Thr 






355 




360 


Asn 


Arg 


Lys 


Val 


He Ala Pro Met 






370 




375 


Trp 


Ser 


Asn 


Phe 


Trp Gly Ala Leu 






385 




390 


Arg 


Ser 


Glu 


Asp 


Tyr Val Glu Leu 






400 




405 


Val 


Trp 


Asn 


Val 


Pro Tyr He Ser 






415 




420 


Asp 


Thr 


Leu 


Arg 


Met Glu Leu Pro 






430 




435 


Ser 


Asp 


Thr 


Asp 


Pro Asp Met Ala 






445 




450 


Lys 


Gly 


He 


Phe 


Leu His Leu Ser 






460 




465 


Leu 


Leu 


Ala 


Thr 


Ser Arg Tyr Asp 






475 




480 


Leu 


Trp 


Gin 


He 


Phe Asp Asn Pro 






490 




495 


He 


His 


Glu 


Asn 


Tyr Ser Arg Ala 






505 




510 


Glu 


Gin 


Pro 


Cys 


Pro Asp Val Tyr 






520 




525 


Gin 


Met 


Cys 


Asp 


Glu Leu Val Ala 






535 




540 


Trp 


Ser 


Gly 


Gly 


Arg His Glu Asp 






550 




555 


Glu 


Asn 


Val 


Pro 


Thr Val Asp He 






565 




570 


Glu 


Asp 


Gin 


Trp 


Leu Gin Leu Leu 






580 




585 


Thr 


Glu 


Ser 


Leu 


Phe Pro Gly Tyr 






595 




600 


Met 


Asn' 


Phe 


Val 


Val Arg Tyr Arg 






610 




615 


Arg 


Pro 


His 


His 


Asp Ser Ser Thr 






625 




630 


Asn 


His 


Lys 


Gly 


Leu Asp Tyr Glu 






640 




645 


Arg 


Tyr 


Asp 


Cys 


Val He Ser Ser 






655 




660 


Leu 


His 


Pro 


Gly 


Arg Leu Thr His 






670 




675 


Thr 


Trp 


Gly 


Thr 


Arg Tyr He Met 






685 




690 
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<210> 26 

<211> 630 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> niisc_feature 

<223> Xncyte ID No: 7499881CD1 

<400> 26 

Met Ala Tyr Gin lie Val Leu Glu Leu His Phe Ser His Cys Ala 
15 10 15 

Ala Met Gly Ala Ala Leu Leu Met Leu lie Glu Asn Ala Leu He 
20 25 30 

Thr Gin Ser Arg Leu Met Leu Leu Glu Ser Val Leu He Phe Phe 
35. 40 45 

Asn Leu Leu Ala Val Leu Ser Tyr Leu Lys Phe Phe Asn Cys Gin 
50 55 60 

Lys His Ser Pro Phe Ser Leu Ser Trp Trp Phe Trp Leu Thr Leu 
55 70 75 

Thr Gly Val Ala Cys Ser Cys Ala Val Gly He Lys Tyr Met Gly 
80 85 90 

Val Phe Thr Tyr Val Leu Val Leu Gly Val Ala Ala Val His Ala 
95 100 105 

Trp His Leu Leu Gly Asp Gin Thr Leu Ser Asn Val Gly Ala Asp 

110 115 120 

Val Gin Cys Cys Met Arg Pro Ala Cys Met Gly Gin Met Arg Met 

125 130 135 

Ser Gin Gly Val Cys Val Phe Cys His Leu Leu Ala Arg Ala Val 

140 145 150 

Ala Leu Leu Val He Pro Val Val Leu Tyr Leu Leu Phe Phe Tyr 

155 160 * 165 

Val His Leu He Leu Val Phe Arg Ser Gly Pro His Asp Gin He 

170 175 180. 

Met Ser Ser Ala Phe Gin Ala Ser Leu Glu Gly Gly Leu Ala Arg 

185 190 195 

He Thr Gin Gly Gin Pro Leu Glu Val Ala Phe Gly Ser Gin Val 

200 205 210 

Thr Leu Arg Asn Val Phe Gly Lys Pro Val Pro. Cys Trp Leu His 

215 ' 220 225 

Ser His Gin Asp Thr Tyr Pro Met He Tyr Glu Asn- Gly Arg Gly 

230 235 240 

Ser Ser His Gin Gin Gin Val Thr Cys Tyr Pro Phe Lys Asp Val 

245 250 - 255 

Asn Asn Trp Trp He Val Lys Asp Pro Arg Arg His Gin Leu Val 

260 265 270 

Val Ser Ser Pro Pro Arg Pro Val Arg His Gly Asp Met Val Gin 

275 280 285 

Leu Val His Gly Met Thr Thr Arg Ser Leu Asn Thr His Asp Val 

290 295 300 

Ala Ala Pro Leu Ser Pro His Ser Gin Glu Val Ser Cys Tyr He 

305 310 315 

Asp Tyr Asn He Ser Met Pro Ala Gin Asn Leu Trp Arg Leu Glu 

320 325 330 

He Val Asn Arg Gly Ser Asp Thr Asp Val Trp Lys Thr He Leu 

335 340 345 

Ser Glu Val Arg Phe Val His Val Asn Thr Ser Ala Val Leu Lys 

350 355 360 

Leu Ser Gly Ala His Leu Pro Asp Trp Gly Tyr Arg Gin Leu Glu 

365 370 375 

He Val Gly Glu Lys Leu Ser Arg Gly Tyr His Gly Ser Thr Val 

380 385 390 

Trp Asn Val Glu Glu His Arg Tyr Gly Ala Ser Gin Glu Gin Arg 
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"3 Q ^ 


A r\r\ 






405 




Arg 




Arg Glu 


Leu His Ser Pro Ala Gin 


Vai 


Asp Vai 


Ser 










415 






420 


Airg 


Asn 


Leu 


Ser Phe 


Met Ala Arg Phe Ser Glu 


Leu 


Gin Trp Arg 










430 






435 




Leu 


TV T ?s 

AJ.a 


Leu Arg 


Ser Asp Asp Ser Glu His 


Lys 


Tyr Ser 


Ser 










445 






450 


Ser 


Pro 


Leu 


Glu Trp 


Vai Thr Leu Asp Thr Asn lie 


Ala Tyr 


Trp 










460 






465 




JtllS 


Pro 


Arg Thr 


Ser Ala Gin lie His Leu 


Leu 


Gly Asn 


lie 








4 / u 


475 






480 


vai 


lie 


Trp 


vai ser 


Gly Ser Leu Ala Leu Ala 


lie 


Tyr Ala Leu 








/I Q R 


490 






495 


Leu 


Ser 


Leu 


Trp Tyr 


Leu Leu Arg Arg Arg Arg Asn 


Vai His 


Asp 








c n r\ 


505 






510 


Leu 


Pro 


bin 


Asp Ala 


Trp Leu Arg Trp Vai Leu 


Ala 


Gly Ala 


Leu 








blo 


520 






525 


Cys 


AX a 


tjiy 


Gly Trp 


Ala Vai Asn Tyr Leu Pro 


Phe 


Phe Leu 


Met 










535 






540 


ijXU 


T t rr^ 

jjys 


ixir 


Leu Phe 


Leu Tyr His Tyr Leu Pro 


Ala 


Leu Thr 


Phe 










550 






555 




j.j>e 


Lqu 


Leu Leu 


Pro Vai Vai Leu Gin His 


ile 


Ser Asp 


His 










565 






570 


Leu 


Cys 


Arg 


Ser Gin 


Leu Gin Arg Ser lie Phe 


Ser 


Ala Leu 


Vai 










580 






585 


VCLl 




Trp 


Tyr Ser 


Ser Ala Cys His Vai Ser 


Asn 


Thr Leu Arg 








590 


595 






600 


Pro 


Leu 


Thr 


Tyr Gly 


Asp Lys Ser Leu Ser Pro 


His 


Glu Leu 


Lys 








605 


610 






615 


Ala 


Leu 


Arg 


Trp Lys 


Asp Ser Trp Asp lie Leu 


lie 


Arg Lys 


His 








620 


625 






630 



<210> 27 
<211> 242 
<212> PRT 

<213> Homo sapiens . 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 7488579Cpi 

<400> 27 



Met 


Leu 


Leu 


Leu 


Ala 


Pro 


Gin Met 


Leu Asn Leu Leu 


Leu Leu Ala 


1 








5 






10 


15 


Leu 


Pro 


Vai 


Leu 


Ala 


Ser 


Arg Ala 


Tyr Ala Ala Pro 


Ala Pro Gly 










20 






. 25 


30 


Gin 


Ala 


Leu 


Gin 


Arg 


Vai 


Gly lie 


Vai Gly Gly, Gin 


Glu Ala Pro 










35 






40 


45 


Arg 


Ser 


Lys 


Trp 


Pro 


Trp 


Gin Vai 


Ser Leu Arg Vai 


His Gly Pro 










50 






55 


60 


Tyr 


Trp 


Met 


His 


Phe 


Cys 


Gly Gly Ser Leu lie His 


Pro Gin Trp 










65 






70 


75 


Vai 


Leu 


Thr 


Ala 


Ala 


His 


Cys Vai 


Gly Pro Asp Vai 


Lys Asp Leu 










80 






85 


90 


Ala 


Ala 


Leu 


Arg 


Vai 


Gin 


Leu Arg 


Glu Gin His Leu 


Tyr Tyr Gin 










95 






100 


105 


Asp 


Gin 


Leu 


Leu 


Pro 


Vai 


Ser Arg 


lie lie Vai His 


Pro Gin Phe 










110 






115 


120 


Tyr 


lie 


lie 


Gin 


Thr 


Gly 


Ala Asp 


lie Ala Leu Leu 


Glu Leu Glu 










125 






130 


135 


Glu 


Pro 


Vai 


Asn 


lie 


Ser 


Ser His 


lie His Thr Vai 


Thr Leu Pro 










140 






145 


150 
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Pro Ala Ser Glu Thr Phe Pro Pro Gly Met Pro Cys Trp Val Thr 

155 160 165 

Gly Trp Gly Asp Val Asp Asn Asn Val His Leu Pro Pro Pro Tyr 

170 175 180 

Pro Leu Lys Glu Val Glu Val Pro Val Val Glu Asn His Leu Cys 

185 190 195 

Asn Ala Glu Tyr His Thr Gly Leu His Thr Gly His Ser Phe Gin 

200 205 210 

lie Val Arg Asp Asp Met Leu Cys Ala Gly Ser Glu Asn His Asp 

215 220 225 

Ser Cys Gin Gly Asp Ser Gly Gly Pro Leu Val Cys Lys Val Asn 

230 235 240 

Gly Thr 



<210> 28 
<211> 48 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No: 7510521CD1 

.<400> 28 

Met Gly Ala Gly Pro Ser Leu Leu Leu Ala Ala Leu Leu Leu Leii 

15 10 15 
Leu Ser Gly Asp Gly Ala Val Arg Cys Asp Thr Pro Ala Asn Cys 

20 25 30 

Thr Tyr Leu Asp Leu Leu Gly Thr Trp Val Phe Gin Asp His Lys 

35 40 ' 45 

Lys Lys Lys 



<210> 29 

<211> 4384 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No; 7994355CB1 

<400> 29 

agaggcggcg gcgggtgtct gtaggtggtc 

ggacgtggcc ccaggcagcc ctcgcagctc 

cgcctgctgg acgggcccag gcgagatgta 

cggctgatcg cggaggatcg cggagggcgc 

caaccgcctg acgcgctcgc tgcgccgcgc 

gtctcactat gttccccagg ctggtcttga 

tccccccaaa gtgttgggat tacagatgtg 

cttgaaaaaa tgataatgaa actgctgttt 

aacacaggtc tgtttctgaa ctactatcaa 

gacgtgtgaa aagaagcttg cttcacattg 

ttttgctgtt aaagaaagga gcaaatccta 

ttcatttggc agcaagaaat ggtcatggtc 

gaggagccaa atatctgcca gataaaaatg 

gtggatatgg agagacttgt gaagtattaa 

ttattcaaat gacacagaat gaagacctcc 
atttgtctca gcaaagtgaa agccagtacc 
ctacaacaaa tggtcataaa ctgcttagcc 

gccttttaag gattgtgaga atgttttgtc 

gtaatggaat tgatatgggc tacaatggga 



ggtccggcag cagcccggcc cccggacgca 60 
agtgctctag ccggggcaag cccgcgtctc 120 
gggctctggg cgcggaggcc gccggtgggg 180 
gccgaggatg gagagagcga tggagcaact 240 
gcgcaccgtg gagttgcccg aggagatgag 300 
actcctgggc tcaagtgatc cgcaccctgc 360 
agccaccgcg cctggcctgg ggataatctt 420 
atacattaat gccaatggtt atggctgatc 480 
attcaaaatt tgatgtcaat tatgcattcg 540 
cagcaaattg tggatcggtg gaatgcttgg 600 
actatcaaga tatttcaggc tgtacacccc 660 
agagagatac agcacagatc ctactattac 720 
gagtaactcc tctggattta tgtgtacagg 780 
ttcaatatca cccgaggctt tttcagacta 840 
gagaaaacat gttacggcaa gttctggagc 900 
taaagattct aacaagcctt gctgaagttg 960 
tctctagcaa ttatgatgct caaatgaaga 1020 
acgtctttcg aattggtcca tcctccccca 1080 
ataaaactcc aagaagccag gtgttcaagc 1140 
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ctctggaatt gctttggcac tcgttagatg 
tgaaaaacaa aagagactca acagagatca 
atcaagatgc tgcttccatt cctccatttg 
tgtccactgg cacaagggaa tctaaaccag 
cagattgtca ggatgttatt tctatgacag 
tttacatgtg ctgttcttgt cagatgcctc 
ttgtctgcaa acatgatgaa gttttaaaat 
ttgaccactt tcactttctc cttgaatgtc 
taaaagcaca gccttttaaa gatcgctgtg 
agccagattc agatatggtg cacaggccag 
gagattctat ttttaggagt agctgtgaag 
agcaagggat tgctgtacgg ttccatggag 
agtggtttga tattctgtcc aatgagatag 
cagctgatgg aacaactttt cagcctaata 
actattttcg gtttgctggg cagatcttgg 
atatttactt cacacgatcc ttctacaagc 
atgtggcatc cattgatcca gaatatgcga 
taagtgatct gggtctagaa ctaacttttt 
aagaggtgcc tttgaaacct' gggggtggga 
agtacgtcca gcttgttact gaacttcgaa 
cttttttaca gggctttcat atgttcattc 
atgaattgga gctactgctt tctggcatgc 
atacagaata cacaagtggc tatgaaagag 
ttgtagaaga cattactcaa gaggagagag 
ccagggtccc acatggtggg tttgctaata 
caatcgctgc tgtgccatat actccaaatc 
tgctcaagtt acctgaatac ccaagtaaag 
tacattgtgg cagctatggt tacacaatgg 
actgatgcac aattcagaat ggcagaagta 
taaatgcaac ccataggcag ggctgatgct 
gtttctctct tttccctttt atgttttctc 
cacagtagat tttatttttt aaaatgctaa 
aattaatttt atccaagatt gtattaaggc 
tgtaactgtc ttgttaaaag ggtgttttct 
ccagctggtg tgttcttttg catgtaaact 
gaatggattt agattgtcta taaagccaca 
ttgcaatgat ttttt'tagca ttttaatatt 
atactaacag ttgatgaaat gacatataat 
gaattatttg catctttcag tggccagttt 
tttcttacgc ctcattttaa tttataatga 
tcttttcatt acttttatct gtgagctctg 
attataattt aaatatgtta atttttttgc 
tcaagttcaa gtgatatgtg cctaatatca 
ggtggttaat cagttttatt ttgaaaagac 
caggaagtag gaacatgcat acacgaagaa 
atcttccctt ggaaactaaa aatagaatat 
tgcagatttt ctcataagca gtcattgtgt 
tgaaagtttt gctaaattat aaagctcctt 
ttgtcttttc taaatgttaa caaaatatat 
tttaaaagtt accattagct ctaaaattaa 
gctaataatg ttttatgttt tgattttttg 
agttttcaaa tagataacca cctttaatat 
tatgtttctg tttataacca tataatacat 
tagttatgta aaagagatag ataataaaat 
aaaa 



aatggctagt tttaatagcc acagaattga 1200 
cttctatttt actgaaacaa aaaggccaag 1260 
aacctccagg acctgggagc tatgaaaatc 1320 
atgctcttgc agggagacag gaagccagtg 1380 
ctaaccggct aagtgctgtc attcaagctt 1440 
cgggaatgac ttcacctcgt ttcattgaat 1500 
gctttgttaa tagaaatccc aaaattatat 15 60 
ctgagttgat gtcaagattc atgcatatca 1620 
aatggttcta tgaacatttg cattcaggac 1680 
tgaatgaaaa tgatatcctg ctggttcaca 1740 
ttgtgtcaaa agcaaattgt gcaaagctaa 1800 
aagaaggcat gggtcaaggt gttgtgcgtg 1860 
tcaatcctga ttatgcattg tttacccagt 1920 
gcaactctta tgtaaatcct gatcacttga 1980 
gattagcgtt gaaccacagg cagctggtca 2040 
acattcttgg tattcctgta aattaccaag 2100 
aaaatttgca atggatttta gataatgata 2160 
ctgttgagac tgatgtgttt ggagcaatgg 2220 
gtattcttgt gacacaaaat aataaagcgg 2280 
tgacaagagc cattcagcct cagatcaatg 2340 
caccctccct catacagctt tttgatgaat 2400 
cagaaattga tgtgagtgat tggataaaaa 2460 
aagatccagt tattcagtgg ttctgggaag 2520 
ttcttctctt acagtttgtt acgggcagtt 2580 
tcatgggtgg aagtggattg caaaacttta 2640 
ttttaccaac ttcaagcaca tgcatcaaca 2700 
aaatactcaa ggacagactt cttgtggcac 2760 
cataatgaag tctggaaaac tcctctgact 2820 
atttgggaaa atgtcaacaa aaaagcagcc 2880 
tccaatttat aaaggatcat caggttttct 2940 
tgtttgtgat acaattagaa aatataaaat 3000 
ctgaaagtaa tagagactgt cctttttcat 3060 
aaaatctgat tctacattcc acctctgcta 3120 
cctaatttct gatatattat atgaggtcat 3180 
gccatttata ttttagaaaa ctattgtata 3240 
aatacgtatt ttgccacagt gtattctata 3300 
ttaatatata ttgtaaaatt tagactgatg 33 60 
ttatatatga aagcttacgc tatattgtat 342 0 
tccatatgta tatattatgg tctcaatgtt 3480 
aggtaaaatt aaaatgtatt ttaccacgtt 3540 
acacatctga aaaagtaatc tgatgtgcaa 3 600 
ttcttaaatt tgcttttcat cattaaaatg 3660 
cttggatgtt ggtgggtttt tgaatttttg 3720 
gtacttgaat agttacagca tatgtttgaa 3 7 80 
atgctaacgg aaggatttgt tatgtttagg 3 840 
taatgacatt actgtttgta gaatgacata 3900 
ttgccagtaa tgtttgagag acatgtaagt 3960 
taattcgttg gttttgattc tcttattctc 4020 
cttaacagat tacatgaaat ttaggaatta 4080 
gattcggatg ctttatttat agtaactgaa 4140 
aaatttaatt gtagaagtca ctgccttctg 4200 
tacactgctt ataatactaa tgtttacaga 4260 
tggctttgtc atattagttt tttttgcaag 4320 
attaaataac tgagaaaaaa aaaaaaaaaa 4380 

4384 



<210> 30 
<211> 4007 

<212> DNA 

<213> Homo " sapiens 

<220> 

<221> misc^feature 
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<223> Incyte ID No: 7475875CB1 

<400> 30 

atggcatgtt ccatggcgtg tggcggtaga 
agtgagcagg agcaagccat taagggggtt 
gccatggccc gcccatcctc tgagctcctg 
agccatggca taaaaacaat aatcaacctc 
aaccctctgg aacaagaaag tggcttcac.a 
atttacttct acaatttcgg atggaaggat 
gatatggtga aggtgatgac atttgcctta 
gcagggcttg gtcgaacagg tgttttaata 
actgctgacc aagcaattat atttgtgcgg 
ggacagctcc tctgtgtaag ggaatttact 
tcttgctgtg atcccaaagc acatgctgtc 
catctgcttc atggttatga ggcacgactt 
gtttgcaaat tgctgctgga cttagcggag 
gaaggacctg gtctctctgc tgaaatagaa 
ctggataaag agttactgag gcatgacagt 
gtggcagcag attttgacaa tcgaggcatg 
ctttggaaaa ggcggaatgt tgagtgcctt 
agctacagtg actcagattt aaagagggcc 
cagacagtgc ctgcccagat cttggttggc 
cattgttaca tcccacagtc tccagaacca 
acactttctt tctggagtca gtcaaagttt 
tcaccaattt tccatggaaa gatcattcca 
gcagatgttt caggctcaca cagccctggg 
cataaggatc caaaccctgc tcaccagcaa 
gttgggagcc ctggctctgt caggcagaac 
ggctccagtc ccaaagcaca gttcttggtt 
tctgaagcag cttcacactc tgcattacag 
ctggcggcca aagccctagc aaatttaaat 
aaggtagaaa tgtggcagaa agagcttaat 
ggcgagaggg accctttcat cctatgcagc 
gagcctgtaa tcaccaaaga ggatgtggac 
gaagcacttt ttttattaga gaagggacag 
atagtgaacc tgcagacaat tcccgtggat 
aaggcattca ctaaggttaa ttttgattct 
aagaaaatat ttaagcacac gctggaagaa 
cctggcctct agctttcact cgtggtgaat 
ctgttcatat gtgaataagt tgaagattgt 
tgaatgttgt tagtttgtgc tgagaatggt 
aaatatattt aagctacatt tttgttttga 
tttatttatt tgactgagtt aatattattg 
ttcttatttc tttggcattt tggaaataat 
cttgatggaa ctcttcctag agtttttacc 
tgtgtatcca ttcccccaca gatgaactaa 
tatttgttac gaagaaggct attgctacaa 
aattttttgt ttttcctttt ctttttataa 
tttcagagag atttagtttt actttaatgg 
ttgtagaaag tagacttgct ctttgtcaga 
caaaaggagg caggagcagt tctcaacaca 
tgctgagcca aatagcatct ttacagagga 
ttgctaagaa gcagggcttg cctctgtcct 
ggcaggggct ctgtgccagc cctgctctct 
tcttggagct ggaggaagtc aaaactgggc 
agcagcgtgg ccctggcccc acactcccaa 
aggctttata tttggctgct ttcttacaat 
catacaggtt gcattagccc taactggctt 
tccttgtaag ctgaaccatc aattatcagt 
cgtagatacc agaggccaag ccacagagag 
taatcagact aatttcgaac taaagaagtt 
aagactcatt tagaacaaat gaaatttctc 
acatcaaaaa tatttcagtc attatcagtc 



gcttgcaagt atgagaaccc agcccgctgg 60 
tactcatcct gggtcactga taatatactg 120 
gagaagtacc acatcattga tcagttcctc 180 
cagcgccctg gtgagcatgc tagctgtggg 240 
taccttcctg aggctttcat ggaggctggc 300 
tatggtgtag cgtctcttac tactatccta 360 
caggaaggaa aagtagctat ccattgtcat 420 
gcctgttact tagtttttgc aacgagaatg 480 
gcaaagcgac ccaattccat acaaaccaga 540 
cagtttctaa ctcctctccg caatatattc 600 
accttacctc aatatctaat tcgccagcgt 660 
ctgaaacacg tgccaaaaat tatccaccta 720 
aacaggccag tgatgatgaa ggatgtgtcc 780 
aagacaatgt ctgagatggt caccatgcag 840 
gatgtgtcca acccgcctaa ccccactgca 900 
attttctcca atgagcaaca gtttgaccct 960 
caacccctga ctcatctgaa aaggcggctc 1020 
gagaacctcc tggagcaagg ggagactcca 1080 
cacaagccca ggcagcagaa gctcataagc 1140 
gacttacaca aggaagcctt ggttcgcagc 1200 
ggaggcctgg aaggactcaa agataatggg 1260 
aaggaagcac agcagagtgg agctttctct 1320 
gagccagttt cacccagctt tgcaaatgtc. 1380 
gtgtctcact gtcagtgtaa aactcatggt 1440 
agcaggacac cccgaagccc tctggactgt 1500 
gaacatgaaa cccaggacag taaagatctg 1560 
tctgaattga gtgctgaggc aagaagaata 1620 
gaatctgtag aaaaggagga actaaaaagg 1680 
tcccgagatg gagcttggga aagaatatgt 1740 
ttgatgtggt cttgggtgga gcaactgaag 1800 
atgttggttg acaggcgagc agatgccgca 1860 
caccagacta ttctctgcgt gttgcactgc 1920 
gtggaggaag ctttccttgc ccatgccatt 1980 
gaaaatggac caacagttta caacaccctg 2040 
aaaagaaaaa tgacaaaaga tggccctaag 2100 
atttcagacc taaagatcca gatagtatct 2160 
ggggctactt tttctcatag cactttattt 2220 
cgtccgtatt tgaaccaatt atttatttta 2280 
aaaattgcca taaatttggt gccactttct 2340 
tattaacatt ttaagtatat ggtgtttaca 2400 
cataacttgt ctttccaaaa taaccatttt 2460 
aaatagctaa ctttagtagt aaaacctcat 2520 
gaaagtcacc aagtgtctta agctgtttta 2580 
tatttttaaa ggtttctttt ttaactttga 2640 
atgtaacaga gggtttcaaa gcatattatt 2700 
agtgactgtg aagtggttgg gattttttgc 2760 
tttccaaaca accttgccag ccttggctgt 2820 
ccaagcctta ttcccactcc cttgggttgc 2880 
agtgggatca gaggcaggaa gtgtggaaag 2940 
cccggggact ccacagggat attcgtgcag 3000 
cagatgccac agccactctg cagaggtgac 3060 
cactgtttgt actgatggtg tattagcatg 3120 
atctgccact ccatagaccc acttgcctca 3180. 
gagaattaag atttttaaac tgaagttgac 3240 
catgtaagaa gggtgactgc ctaaactagt 33 00 
tgaagccata cttttattta aattaatata 3360 
gataatagtt cttcccaata aaggtgatat 3420 
actgcttaaa gacggaattt caggggaagc 3480 
cagtcctaca tttctgaatt gacttctagc 3540 
tcattaactg aaatgccaaa tgctaaatgc 3600 
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agtgttcttt cacactgttt taattttctt gggaaattga gtccagtgga tgttaatgga 3660 
gtgggttgcc catccctgaa atgtcttatt ttcaagtgcc tggcctggga aagaagggga 3720 
agaaacaatt gcattatatc caaagataca ctataaaaat agagttttta ccaaaaaaag 3780 

atgtttgttc tcatctcagt aggcctcatt tgggcaagtg acccacaggt cttttggcga 3840 
gtttgctatt tgcctgttga aatacttgtt tcaacttaga gaacagttat gatgtgacca 3900 
tagcatggca caactaaaaa tctaagcctg aaacctgaaa aaagagatat gacaagggaa 3960 
attaatcagg ctatacataa gtattgtatt tatttgaata aaaataa 4007 

<210> 31 
<211> 4524 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc__feature 

<223> Incyte ID No: 71231882CB1 

<400> 31 

ccgccctcgc caacatggcg gcgcccagtt ggggcgggtt cgttcgcttc gcgttttggc 60 
cagggcgggg gtctgggctt taggcaggta gtatttagtt tcacaatgtt tggggacctg 120 
tttgaagagg agtattccac tgtgtctaat aatcagtatg gaaaagggaa gaaattaaag 180 
actaaagctt tggagccacc tgctcctaga gaattcacca atttaagcgg aatcagaaat 240 
cagggtggaa cctgttacct caattccctt cttcagactc ttcatttcac acctgaattc 300 
agagaagctc tattttctct tggcccagaa gagcttggtt tgtttgaaga taaggataaa 360 
cccgatgcaa aggttcgaat catcccttta cagttacagc gcttgtttgc tcagcttctg 420 
ctcttagacc aggaagctgc atccacagca gacctcactg acagctttgg gtggaccagt 480 
aatgaggaaa tgaggcaaca tgatgtgcag gaactgaatc gaatcctctt cagcgctttg 540 
gaaacttctt tagttgggac ctccggtcat gacctcatct atcgtctgta ccatggaacc 600 
attgttaacc agattgtttg taaagaatgt aagaacgtta gcgagaggca ggaagacttc 660 
ttagatctaa cagtagcagt caaaaatgta tccggtttgg aagatgctct ctggaacatg 720 
tatgtagaag aggaagtttt tgattgtgac aacttgtacc actgtggaac ttgtgacagg 780. 
ctggttaaag cagcaaagtc ggccaaatta cgtaagctgc ctccttttct tactgtttca 840 
ttactaagat ttaattttga ttttgtgaaa tgcgaacgct acaaggaaac tagctgttat 900 
acattccctc tccggattaa tctcaagccc ttttgtgaac agagtgaatt ggatgactta 960 
gaatatatat atgacctctt ctcagttatt atacacaaag gtggctgcta cggaggccat 1020 
taccatgtat atattaaaga tgttgatcat ttgggaaact ggcagtttca agaggaaaaa 1080 
agtaaaccag atgtgaatct gaaagatctc cagagtgaag aagagattga tcatccactg 1140 
atgattctaa aagcaatctt attagaggag gagaataatc taattcctgt tgatcagctg 1200 
ggccagaaac ttttgaaaaa gataggaata tcttggaaca agaagtacag aaaacagcat 1260 
ggaccattgc ggaagttctt acagctccat tctcagatat ttctactcag ttcagatgaa 1320 
agtacagttc gtctcttgaa gaatagttct ctccaggctg agtctgattt ccaaaggaat 1380 
gaccagcaaa ttttcaagat gcttcctcca gaatccccag gtttaaacaa tagcatctcc 1440 
tgtccccact ggtttgatat aaatgattct aaagtccagc caatcaggga aaaggatatt 1500 
gaacagcaat ttcagggtaa agaaagtgcc tacatgttgt tttatcggaa atcccagttg 1560 
cagagacccc ctgaagctcg agctaatcca agatatgggg ttccatgtca tttactgaat 1620 
gaaatggatg cagctaacat tgaactgcaa accaaaaggg cagaatgtga ttctgcaaac 1680 
aatacttttg aattgcatct tcacctgggc cctcagtatc atttcttcaa tggggctctg 1740 
cacccagtag tctctcaaac agaaagcgtg tgggatttga cctttgataa aagaaaaact 1800 
ttaggagatc tccggcagtc aatatttcag ctgttagaat tttgggaagg agacatggtt 1860 
cttagtgttg caaagcttgt accagcagga cttcacattt accagtcact tggcggggat 1920 
gaactgacac tgtgtgaaac tgaaattgct gatggggaag acatctttgt gtggaatggg 1980 
gtggaggttg gtggagtcca cattcaaact ggtattgact gcgaacctct acttttaaat 2040 
gttcttcatc tagacacaag cagtgatgga gaaaagtgtt gtcaggtgat agaatctcca 2100 
catgtctttc cagctaatgc agaagtgggc actgtcctca cagccttagc aatcccagca 2160 
ggtgtcatct tcatcaacag tgctggatgt ccaggtgggg agggttggac ggccatcccc 2220 
aaggaagaca tgaggaagac gttcagggag caagggctca gaaatggaag ctcaatttta 22S0 
attcaggatt ctcatgatga taacagcttg ttgaccaagg aagagaaatg ggtcactagt 2340 
atgaatgaga ttgactggct ccacgttaaa aatttatgcc agttagaatc tgaagagaag 2400 
caagttaaaa tatcagcaac tgttaacaca atggtgtttg atattcgaat taaagccata 2460 
aaggaattaa aattaatgaa ggaactagct gacaacagct gtttgagacc tattgataga 2520 
aatgggaagc ttctttgtcc agtgccggac agctatactt tgaaggaagc agaattgaag 2580 
atgggaagtt cattgggact gtgtcttgga aaagcaccaa gttcgtctca gttgttcctg 2640 
ttttttgcaa tggggagtga cgttcaacct gggacagaaa tggaaatcgt agtagaagaa 2700 
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acaatatctg tgagagattg tttaaagtta atgctgaaga aatctggcct acaaggagat 2760 
gcctggcatt tacgaaaaat ggattggtgc tatgaagctg gagagccttt atgtgaagaa 2820 
aattcagcca gaagccaact cattaccctt ggaactggct tctcgttcca gccttgccag 2880 
gatgcaacac tgaaagaact tctgatatgt tctggagata ctttgctttt aattgaagga 2940 
caacttcctc ctctgggttt cctgaaggtg cccatctggt ggtaccagct tcagggtccc 3000 
tcaggacact gggagagtca tcaggaccag accaactgta cttcgtcttg gggcagagtt 3060 
tggagagcca cttccagcca aggtgcttct gggaacgagc ctgcgcaagt ttctctcctc 3120 
tacttgggag acatagagat ctcagaagat gccacgctgg cggagctgaa gtctcaggcc 3180 
atgaccttgc ctcctttcct ggagttcggt gtcccgtccc cagcccacct cagagcctgg 3240- 
acggtggaga ggaagcgccc aggcaggctt ttacgaactg accggcagcc actcagggaa 3300 
tataaactag gacggagaat tgagatctgc ttagagcccc ttcagaaagg cgaaaacttg 3360 
ggcccccagg acgtgctgct gaggacacag gtgcgcatcc ctggtgagag gacctatgcc 3420 
cctgccctgg acctggtgtg gaacgcggcc cagggtggga ctgccggctc cctgaggcag 3480 
agagttgccg atttctatcg tcttcccgtg gagaagattg aaattgccaa atactttccc 3540 
gaaaagttcg agtggcttcc gatatctagc tggaaccaac aaataaccaa gaggaaaaag 3600 
aaaaaaaaac aagattattt gcaaggggca ccgtattact tgaaagacgg agatactatt 3660 
ggtgttaaga atctcctgat tgacgacgat gatgatttca gtacaatcag agatgacact 3720 
ggaaaagaaa agcagaaaca acgggccctg gggagaagga aaagccaaga agccctccat 3780 
gagcagagca gctacatcct ctccagtgca gagacgcctg cccggccccg agccccggaa 3840 
acttctctct ccatccacgt ggggagcttc agataaccgc gccgctgcac ggctctactc 3900 
ccgatgaact ctccggctga tgccacaaac gtgggtttcc tgggcatggg gactggctgc 3960 
ctggcgcctc caatcccaaa tcctctgctt cctttgagca cagggacggc tcctctgagg 4020 
cctggccagt gcatgtagtc acttagctct gcaacacgtg gcagccacgg gggctggtgc 4080 
agctctggat gtcgcccacc cagctgccag taggtgctgg gctctctcac acagcacccg 4140 
gccccagctg cctttttttt tcttttaacc agaaaatgca caacgtgtgc gtgaaccgca 4200 
ggtatggagg cagcggcatg ccgttgctcc gctgtgggag gtgtgtgggg tcaggccagc 4260 
cactttcctc cgtgttcaga tgactctcgt tcgccctgac cggcttctca cagtgtctca 4320 
ggccactgcg ccaccgcgct ggtgctgagc agaagcgggc agaagtgggg tctgctttca 4380 
ggacttcatt tcccccactc gttccggccc cgcatgctcc acgtctgccc tttggtctga 4440 
gttaaaactg cgatgctgaa aagtgcgagc tctttccacg aggaggagcc acacagggtg 4500 
gcctccgagg gtgagtcgct ctgc 4524 

<210> 32 

<211> 3250 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc„featiire 

<223> Incyte ID No: 2875922CB1 

<400> 32 

cggcggctgt ggccgcggcg aacaggccgg acgcctcgtc gctggcgggg cttcccttgg 60 
agctgacgaa ggcggggtcc gcggtccagg ctgctgccgc gacggggccg gcggcggggc 120 
agctgccagg aacgagggta gcagctgcat ccagatctca ttatgcatca gaaaaatgaa 180 
aaaacagagg aaaattctat ggaggaaagg aatccactta gccttttctg agaaatggaa 240 
tactgggttt ggaggcttta agaagtttta ttttcaccaa cacttgtgca ttctgaaagc 300 
taagctggga aggccagtta cttggaatag acagttgaga catttccagg gtagaaagaa 360 
agctcttcaa atccagaaaa cgtggatcaa ggatgaaccc ctttgtgcta agaccaagtt 420 
caatgtggct actcaaaatg ttagtacttt gtcctctaaa gtgaaaagaa aggacgctaa 480 
acacttcatt tcctcctcaa agactctcct gagactccaa gcagagaagc tgttgtcatc 540 
agcaaagaat tctgaccatg aatactgcag agagaaaaat ctcttgaagg cagttactga 600 
ctttccatca aatagtgctt taggtcaggc caatggtcac agacctagga cagacccaca 660 
accttctgac tttcccatga agttcaatgg ggagagccaa agtccaggtg agagtggcac. 720 
gattgtggtc accttgaaca accataagag aaagggcttt tgttacggct gctgccaagg 780 
gccggagcac cacaggaatg ggggaccctt gattccaaaa aagttccaac ttaaccaaca 840 
tagaaggata aaattatctc ctcttatgat gtatgagaaa ttatccatga ttagatttcg 900 
gtacaggatt ctcagatccc agcacttcag aaccaaaagc aaggtttgca agctaagaaa 960 
agcccagcga agctgggtac agaaagtcac tggggaccat caagagaccc gtagggagaa 1020 
cggtgagggt ggcagttgca gcccatttcc ttccccagaa cctaaagacc cttcttgtcg 1080 
gcatcagccg tactttccag atatggacag cagtgctgtg gtgaagggga cgaactctca 1140 
tgtgcctgat tgccacacta aaggaagctc tttcttgggc aaggagctta gtttagacga 1200 
agcattccct gaccaacaga atggcagtgc cacaaacgcc tgggaccagt catcctgttc 1260 
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ttctcctaag tgggagtgta cagagctgat tcatgacatc cccttaccag « aacatcgttc 1320 
taataccatg ttcatttcag aaactgaaag agaaattatg actctgggtc aggaaaatca 1380 
gacaagttct gtcagtgatg acagagtaaa actgtcagtg tctggagcag atacatctgt 1440 
gagtagcgta gatgggcctg tgtcccaaaa ggctgttcaa aatgagaact cataccagat 1500 
ggaggaggat ggatctctca agcagagcat tcttagttct gagttgctgg accaccctta 1560 
ctgtaaaagt ccactggagg ctcccttggt gtgcagtgga ctcaaactag aaaatcaagt 1620 
aggaggtgga aagaacagtc agaaagcctc tccagtggat gatgaacagc tgtcagtctg 1680 
tctttctgga ttcctagatg aggttatgaa gaagtatggc agtttggttc cactcagtga 1740 
aaaagaagtc cttggaagat taaaagatgt ctttaatgaa gacttttcta atagaaaacc 1800 
atttatcaat agggaaataa caaactatcg ggccagacat caaaaatgta acttccgtat 1860 
cttctataat aaacacatgc tggatatgga cgacctggcg actctggatg gtcagaactg 1920 
gctgaatgac caggtcatta atatgtatgg tgagctgata atggatgcag tcccagacaa 1980 
agttcacttc ttcaacagct tttttcatag acagctggta accaaaggat ataatggagt 2040 
aaaaagatgg actaaaaagg tggatttgtt taaaaagagt cttctgttga ttcctattca 2100 
cctggaagtc cactggtctc tcattactgt gacactctct aatcgaatta tttcatttta 2160 
tgattcccaa ggcattcatt ttaagttttg tgtagagaat ataagaaagt atttgctgac 2220 
tgaagccaga gaaaaaaata gacctgaatt tcttcagggt tggcagactg ctgttacgaa 2280 
gtgtattcca caacagaaaa acgacagtga ctgtggagtc tttgtgctcc agtactgcaa 2340 
gtgcctcgcc ttagagcagc ctttccagtt ttcacaagaa gacatgcccc gagtgcggaa 2400 
gaggatttac aaggagctat gtgagtgccg gctcatggac tgaaactcag cagggactct 2460 
gggaagtctg accaagttgg agcagatggt ttgttacttg aatctccaaa cacttagttg 2520 
aatttttaca gatatttcag atcagtggtg ttgggccact attgttacct caaatttatt 2580 
ttttgccctt attcatttct ccagctacca tgtactattg tttaatgttc agtttggttt 2640 
catttttaat tttatggttc tgtgcgtccc ccatatttaa tatttattat tcaaacgcat 2700 
gcatatagac agagcatgca gtgaagagta ttaaaaaaaa aagcttagta gatttggtgc 2760 
agcttttgaa acttagttag acgtgaactg aatacaggtt tcaaatttac tcccagaacc 2820 
taaaaatgca agatgttttt gatacaacat aactctgaga atagtaagtg • ttccctgggg 2880 
cattaagggt agctgggggt ggttttgaca aatccagtcc tgttttactt taccagcggc 2940 
aactttcacc aacttcccct ctccaagtga gtcttagaga gtgcagtcca ttccttttga 3000 
agggtgagat ggaagtggtc gtaaactgac tggtgtcttc tgtttctgga ggcacacttg 3060 
taagcacagt ggctgctttg ggaggagtaa ggtgtgagaa aaagcaacct tggaggccag 312 0 
taacaatgac agatttcaat cgtggtttta ggaattataa tacgtggcat acatctcata 3180 
aaggcttttg ctgggatatt gaattccctg aatttttctg ttttcgacct gttaaaaaaa 3240 
tcttaacatc 3250 

<210> 33 

<211> 3834 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No; 8158136CB1 

<400> 33 

gcggccggtc gtgcgggtcg ggcgcgggcg ggcgcggcgg cagtggcgcg cacaggtgat 60 
tgactggcca gctgcctgaa ggagcgccag gtcctccttg ctggcaggtg gcgaagccca 120 
ttggggcggc ggtgcagacc gcggcggcgg ctgcggcggt ctggctcggg aggcgttcct 180 
ggggccaagg ccatggcccc gcggctgcag ctggagaagg cggcctggcg ctgggcggag 240 
acggtgcggc ccgaggaggt gtcgcaggag cacatcgaga ccgcttaccg catctggctg 300 
gagccctgca ttcgcggcgt gtgcagacga aactgcaaag gaaatccgaa ttgcttggtt 360 
ggtattggtg agcatatttg gttaggagaa atagatgaaa atagttttca taacatcgat 420 
gatcccaact gtgagaggag aaaaaagaac tcatttgtgg gcctgactaa ccttggagcc 480 
acttgttatg tcaacacatt tcttcaagtg tggtttctca acttggagct tcggcaggca 540 
ctctacttat gtccaagcac ttgtagtgac tacatgctgg gagacggcat ccaagaagaa 600 
aaagattatg agcctcaaac aatttgtgag catctccagt acttgtttgc cttgttgcaa 660 
aacagtaata ggcgatacat tgatccatca ggatttgtta aagccttggg cctggacact 720 
ggacaacagc aggatgctca agaattttca aagctcttta tgtctctatt ggaagatact 780* 
ttgtctaaac aaaagaatcc agatgtgcgc aatattgttc aacagcagtt ctgtggagaa 840 
tatgcctatg taactgtttg caaccagtgt ggcagagagt ctaagctttt gtcaaaattt 900 
tatgagctgg agttaaatat ccaaggccac aaacagttaa cagattgtat ctcggaattt 960 
ttgaaggaag aaaaattaga aggagagaat cgctattttt gcgagaactg tcaaagcaaa 1020 
cagaatgcaa caagaaagat tcgacttctt agccttcctt gcactctgaa cttgcagcta 1080 
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atgcgttttg tctttgacag gcaaactgga cataagaaaa agctgaatac ctacattggc 1140 
ttctcagaaa ttttggatat ggagccttat gtggaacata aaggtgggtc ctacgtgtat 1200 
gaactcagcg cagtcctcat acacagagga gtgagtgctt attctggcca ctacatcgcc 1260 
cacgtgaaag atccacagtc tggtgaatgg tataagttta atgatgaaga catagaaaag 1320 
atggagggga agaaattaca actagggatt gaggaagatc tagcagaacc ttctaagtct 13 80 
cagacacgta aacccaagtg tggcaaagga actcattgct ctcgaaatgc atatatgttg 1440 
gtttatagac tgcaaactca agaaaagccc aacactactg ttcaagttcc agcctttctt 1500 
caagagctgg tagatcggga taattccaaa tttgaggagt ggtgtattga aatggctgag 1560 
atgcgtaagc aaagtgtgga taaaggaaaa gcaaaacacg aagaggttaa ggagctgtac 1620 
caaaggttac ctgctggagc tgagccctat gagtttgtct ctctggaatg gctgcaaaag 1680 
tggttggatg aatcaacacc taccaaacct attgataatc acgcttgcct gtgttcccat 1740 
gacaagcttc acccggataa aatatcaatt atgaagagga tatctgaata tgcagctgac 1800 
attttctata gtagatatgg aggaggtcca agactaactg tgaaagccct gtgtaaggaa 1860 
tgtgtagtag aacgttgtcg catattgcgt ctgaagaacc aactaaatga agattataaa 1920 
actgttaata atctgctgaa agcagcagta aagggcgatg gattttgggt ggggaagtcc 1980 
tccttgcgga gttggcgcca gctagctctt gaacagctgg atgagcaaga tggtgatgca 2040 
gaacaaagca acggaaagat gaacggtagc accttaaata aagatgaatc aaaggaagaa 2100 
agaaaagaag aggaggaatt aaattttaat gaagatattc tgtgtccaca tggtgagtta 2160 
tgcatatctg aaaatgaaag aaggcttgtt tctaaagagg cttggagcaa actgcagcag 2220 
tactttccaa aggctcctga gtttccaagt tacaaagagt gctgttcaca gtgcaagatt 2280 
ttagaaagag aaggggaaga aaatgaagcc ttacataaga tgattgcaaa cgagcaaaag 2340 
acttctctcc caaatttgtt ccaggataaa aacagaccgt gtctcagtaa ctggccagag 2400 
gatacggatg tcctctacat cgtgtctcag ttctttgtag aagagtggcg gaaatttgtt 2460 
agaaagccta caagatgcag ccctgtgtca tcagttggga acagtgctct tttgtgtccc 2520 
cacgggggcc tcatgtttac atttgcttcc atgaccaaag aagattctaa acttatagct 2580 
ctcatatggc ccagtgagtg gcaaatgata caaaagctct ttgttgtgga tcatgtaatt 2640 
aaaatcacga gaattgaagt gggagatgta aacccttcag aaacacagta tatttctgag 2700 
cccaaactct gtccagaatg cagagaaggc ttattgtgtc agcagcagag ggacctgcgt 2760 
gaatacactc aagccaccat ctatgtccat aaagttgtgg ataataaaaa ggtgatgaag 2820 
gattcggctc cggaactgaa tgtgagtagt tctgaaacag aggaggacaa ggaagaagct 2880 
aaaccagatg gagaaaaaga tccagatttt aatcaaagca atggtggaac aaagcggcaa 2940 
aagatatccc atcaaaatta tatagcctat caaaagcaag ttattcgccg aagtatgcga 3000 
catagaaaag ttcgtggtga gaaagcactt ctcgtttctg ctaatcagac gttaaaagaa 3060 
ttgaaaattc agatcatgca tgcattttca gttgctcctt ttgaccagaa tttgtcaatt 3120 
gatggaaaga ttttaagtga tgactgtgcc accctaggca cccttggcgt cattcctgaa 3180 
tctgtcattt tattgaaggc tgatgaacca attgcagatt atgctgcaat ggatgatgtc 3240 
atgcaagttt gtatgccaga agaagggttt aaaggtactg gtcttcttgg acattaatct 3300 
ttgaatactt gctgactgct aagaaatgac cagaggggaa gaggagtttg acatgttagg 3360 
gcattaaagc aaaggtggat ttaagaatta aaccattaca tgccccttcc aaaaggcaga 3420 
aatccattca aacgtgactg tcccaaatgc cttatgtcaa ataaagcaga ttgcactgat 3480 
ggacatcaga cttgaaggaa atgtttccaa ttttatattt aaggggggtg gtgggtggga 3540 
gggggcaagt aaagacggaa caagtttagt agcagtaata gtaaatcatg tttacatatg 3600 
agatttatag tcgtgggagg ggaataaagt tctgttatat ttccttgctc gagtttcata 3660 
ccagatgcgt tggtccataa aggattgtat caagtagatg ggacaacatt ctgctctgaa 3720 
cgaaaagtaa ttttagagac ataacctgct taccaatgcc tgtctttgat tcatattcta 3780 
ctttcaataa agcatgaaag tgaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaa 3834 

.<210> 34 

<211> 4493 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc^feature 

<223> Incyte ID No: 5969491CB1 

<400> 34 

gcgcccagtt ggggcgggtt cgttcgcttc gcgtttggcc aggcgggggt ctgggcttta 60 
ggcaggtagt atttagtttc acaatgtttg gggacctgtt tgaagaggag tattccactg 120 
tgtctaataa tcagtatgga aaagggaaga aattaaagac taaagctttg gagccacctg 180 
ctcctagaga attcaccaat ttaagcggaa tcagaaatca gggtggaacc tgttacctca 240 
attcccttct tcagactctt catttcacac ctgaattcag agaagctcta ttttctcttg 300 
gcccagaaga gcttggtttg tttgaagata aggataaacc cgatgcaaag gttcgaatca 360 
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tccctttaca gttacagcgc ttgtttgctc 
ccacagcaga cctcactgac agctttgggt 
atgtgcagga actgaatcga atcctcttca 
ccggtcatga cctcatctat cgtctgtacc 
aagaatgtaa gaacgttagc gagaggcagg 
aaaatgtatc cggtttggaa gatgctctct 
attgtgacaa cttgtaccac tgtggaactt 
ccaaattacg taagctgcct ccttttctta 
ttgtgaaatg cgaacgctac aaggaaacta 
tcaagccctt ttgtgaacag agtgaattgg 
cagttattat acacaaaggt ggctgctacg 
ttgatcattt gggaaactgg cagtttcaag 
aagatctcca gagtgaagaa gagattgatc 
tagaggagga gaataatcta attcctgttg 
taggaatatc ttggaacaag aagtacagaa 
agctccattc tcagatattt ctactcagtt 
atagttctct ccaggctgag tctgatttcc 
ttcctccaga atccccaggt ttaaacaata 
atgattctaa agtccagcca atcagggaaa 
aaagtgccta catgttgttt ^tatcggaaat 
ctaatccaag atatggggtt ccatgtcatt 
aactgcaaac caaaagggca gaatgtgatt 
acctgggccc tcagtatcat ttcttcaatg 
aaagcgtgtg ggatttgacc tttgataaaa 
tatttcagct gttagaattt tgggaaggag 
cagcaggact tcacatttac cagtcacttg 
aaattgctga tggggaagac atctttgtgt 
ttcaaactgg tattgactgc gaacctctac 
gtgatggaga aaagtgttgt caggtgatag 
aagtgggcac tgtcctcaca gccttagcaa 
ctggatgtcc aggtggggag ggttggacgg 
tcagggagca agggctcaga aatggaagct 
acagcttgtt gaccaaggaa gagaaatggg 
acgttaaaaa tttatgccag ttagaatctg 
ttaacacaat ggtgtttgat attcgaatta 
aactagctga caacagctgt ttgagaccta 
tgccggacag ctatactttg aaggaagcag 
gtcttggaaa agcaccaagt tcgtctcagt 
ttcaacctgg gacagaaatg gaaatcgtag 
taaagttaat gctgaagaaa tctggcctac 
attggtgcta tgaagctgga gagcctttat 
tgatatgttc tggagatact ttgcttttaa 
tgaaggtgcc catctggtgg taccagcttc 
aggaccagac caactgtact tcgtcttggg 
gtgcttctgg gaacgagcct gcgcaagttt 
cagaagatgc cacgctggcg gagctgaagt 
agttcggtgt cccgtcccca gcccacetca 
gcaggctttt acgaactgac cggcagccac 
agatctgctt agagcccctt cagaaaggcg 
ggacacaggt gcgcatccct ggtgagagga 
acgcggccca gggtgggact gccggctccc 
ttcccgtgga gaagattgaa attgccaaat 
tatctagctg gaaccaacaa ataaccaaga 
aaggggcacc gtattacttg aaagacggag 
acgacgatga tgatttcagt acaatcagag 
gggccctggg gagaaggaaa agccaagaag 
ccagtgcaga gacgcctgcc cggccccgag 
ggagcttcag ataaccgcgc cgctgcacgg 
ccacaaacgt gggtttcctg ggcatgggga 
ctctgcttcc tttgagcaca gggacggctc 
ttagctctgc aacacgtggc agccacgggg 
gctgccagta ggtgctgggc tctctcacac 
ttttaaccag aaaatgcaca acgtgtgcgt 



agcttctgct cttagaccag gaagctgcat 420 
ggaccagtaa tgaggaaatg aggcaacatg 480 
gcgctttgga aacttcttta gttgggacct 540 
atggaaccat tgttaaccag attgtttgta 600 
aagacttctt agatctaaca gtagcagtca 660 
ggaacatgta tgtagaagag gaagtttttg 720 
gtgacaggct ggttaaagca gcaaagtcgg 780 
ctgtttcatt actaagattt aattttgatt 840 
gctgttatac attccctctc cggattaatc 900 
atgacttaga atatatatat gacctcttct 960 
gaggccatta ccatgtatat attaaagatg 1020 
aggaaaaaag taaaccagafc gtgaatctga 1080 
atccactgat gattctaaaa gcaatcttat 1140 
atcagctggg ccagaaactt ttgaaaaaga 1200 
aacagcatgg accattgcgg aagttcttac 1260 
cagatgaaag tacagttcgt ctcttgaaga 1320 
aaaggaatga ccagcaaatt ttcaagatgc 1380 
gcatctcctg tccccactgg tttgatataa 1440 
aggatattga acagcaattt cagggtaaag 1500 
cccagttgca gagaccccct gaagctcgag 1560 
tactgaatga aatggatgca gctaacattg 1620 
ctgcaaacaa tacttttgaa ttgcatcttc 1680 
gggctctgca cccagtagtc tctcaaacag 1740 
gaaaaacttt aggagatctc cggcagtcaa 1800 
acatggttct tagtgttgca aagcttgtac 1860 
gcggggatga actgacactg tgtgaaactg 1920 
ggaatggggt ggaggttggt ggagtccaca 1980 
ttttaaatgt tcttcatcta gacacaagca 2040 
aatctccaca tgtctttcca gctaatgcag 2100 
tcccagcagg tgtcatcttc atcaacagtg 2160 
ccatccccaa ggaagacatg aggaagacgt 2220 
caattttaat tcaggattct catgatgata 2280 
tcactagtat gaatgagatt gactggctcc 2340 
aagagaagca agttaaaata tcagcaactg 2400 
aagccataaa ggaattaaaa ttaatgaagg 2460 
ttgatagaaa tgggaagctt ctttgtccag 2520 
aattgaagat gggaagttca ttgggactgt 2580 
tgttcctgtt ttttgcaatg gggagtgacg 2640 
tagaagaaac aatatctgtg agagattgtt 2700 
aaggagatgc ctggcattta cgaaaaatgg 2760 
gtgaagaaga tgcaacactg aaagaacttc 2820 
ttgaaggaca acttcctcct ctgggtttcc 2880 
agggtccctc aggacactgg gagagtcatc 2940 
gcagagtttg gagagccact tccagccaag 3000 
ctctcctcta cttgggagac atagagatct 3060 
ctcaggccat gaccttgcct cctttcctgg 3120 
gagcctggac ggtggagagg aagcgcccag 3180 
tcagggaata taaactagga cggagaattg 3240 
aaaacttggg cccccaggac gtgctgctga 3300 
cctatgcccc tgccctggac ctggtgtgga 3360 
tgaggcagag agttgccgat ttctatcgtc 3420 
actttcccga aaagttcgag tggcttccga 3480 
ggaaaaagaa aaaaaaacaa gattatttgc 3540 
atactattgg tgttaagaat ctcctgattg 3600 
atgacactgg aaaagaaaag cagaaacaac 3660 
ccctccatga gcagagcagc tacatcctct 3720 
ccccggaaac ttctctctcc atccacgtgg 3.780 
ctctactccc gatgaactct ccggctgatg 3840 
ctggctgcct ggcgcctcca atcccaaatc 3900 
ctctgaggcc tggccagtgc atgtagtcac 3960 
gctggtgcag ctctggatgt cgcccaccca 4020 
agcacccggc cccagctgcc tttttttttc 4080 
gaaccgcagg tatggaggca gcggcatgcc 4140 
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gttgctccgc tgtgggaggt gtgtggggtc aggccagcca ctttcctccg tgttcagatg 4200 
actctcgttc gccctgaccg gcttctcaca gtgtctcagg ccactgcgcc accgcgctgg 4260 
tgctgagcag aagcgggcag aagtggggtc tgctttcagg acttcatttc ccccactcgt 4320 
tccggccccc gcatgctcca cgtctgccct ttggtctgag ttaaaactgc gatgctgaaa 4380 
agtgcgagct ctttccacga ggaggagcca cacagggtgg cctccgaggg tgagtcgctc 4440 
tgctaagcaa gggcagccgc tgcacgtcag cccgcaggcc aagggtccag ctt 4493 

<210> 35 

<211> 2921 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc^feature 

<223> Incyte ID No: 74973 67CB1 

<400> 35 

ctggggctgg attgagctga ccacaggcca caccagactc ctctctgctc ctgaggaaga 60 
cagggcagcc cggcgccacc cgctcggccc tcacgaagat gctccctgga gcctggctgc 120 
tctggacctc cctcctgctc ctggccaggc ctgcccagcc ctgtcccatg ggttgtgact 180 
gcttcgtcca ggaggtgttc tgctcagatg aggagcttgc caccgtcccg ctggacatcc 240 
cgccatatac gaaaaacatc atctttgtgg agacctcgtt caccacattg gaaaccagag 300 
cttttggcag taaccccaac ttgaccaagg tggtcttcct caacactcag ctctgccagt 360 
ttaggccgga tgcctttggg gggctgccca ggctggagga cctggaggtc acaggcagta 420 
gcttcttgaa cctcagcacc aacatcttct ccaacctgac ctcgctgggc aagctcaccc 480 
tcaacttcaa catgctggag gctctgcccg agggtctttt ccagcacctg gctgccctgg 540 
agtccctcca cctgcagggg aaccagctcc aggccctgcc caggaggctc ttccagcctc 600 
tgacccatct gaagacactc aacctggccc agaacctcct ggcccagctc ccggaggagc 660 
tgttccaccc actcaccagc ctgcagaccc tgaagctgag caacaacgcg ctctctggtc 720 
tcccccaggg tgtgtttggc aaactgggca gcctgcagga gctcttcctg gacagcaaca 780 
acatctcgga gctgccccct caggtgttct cccagctctt ctgcctagag aggctgtggc 840 
tgcaacgcaa cgccatcacg cacctgccgc tctccatctt tgcctccctg ggtaatctga 900 
cctttctgag cttgcagtgg aacatgcttc gggtcctgcc tgccggcctc tttgcccaca 960 
ccccatgcct ggttggcctg tctctgaccc ataaccagct ggagactgtc gctgagggca 1020 
cctttgccca cctgtccaac ctgcgttccc tcatgctctc atacaatgcc attacccacc 1080 
tcccagctgg catcttcaga gacctggagg agttggtcaa actctacctg ggcagcaaca 1140 
accttacggc gctgcaccca gccctcttcc agaacctgtc caagctggag ctgctcagcc 1200 
tctccaagaa ccagctgacc acacttccgg agggcatctt cgacaccaac tacaacctgt 1260 
tcaacctggc cctgcacggt aacccctggc agtgcgactg ccacctggcc tacctcttca 1320 
actggctgca gcagtacacc gatcggctcc tgaacatcca gacctactgc gctggccctg 1380 
cctacctcaa aggccaggtg gtgcccgcct tgaatgagaa gcagctggtg tgtcccgtca 1440 
cccgggacca cttgggcttc caggtcacgt ggccggacga aagcaaggca gggggcagct .1500 
gggatctggc tgtgcaggaa agggcagccc ggagccagtg cacctacagc aaccccgagg 1560 
gcaccgtggt gctcgcctgt gaccaggccc agtgtcgctg gctgaacgtc cagctctctc 1620 
ctcggcaggg ctccctggga ctgcagtaca atgctagtca ggagtgggac ctgaggtcga 1680 
gctgcggttc tctgcggctc accgtgtcta tcgaggctcg ggcagcaggg ccctagtagc 1740 
agcgcataca ggagctgggg aagggggcct ctggggcctg accaggcgac aggtaggggc 1800 
ggaggggagc tgagtctccg aagccttggc ttttcacatg caagggacag ggttacatcc 1860 
ccaaggtgag ggggtggagt ctggtctgct ccactaacca gggtctcctc ctcctcttcc 1920 
ttcatcgctt ctcctggagt gtgcggccta acaaggccat ccttatgctt tgcaaagcac 1980 
cctcaaaagc tgcaccacag cctggagaat aaaatatcct cagccctgat gcctccccat 2040 
tatgtaacac ccaaccgctc tcacctacac cctgaggtct attcactgca tcccagtgat 2100 
acaaagtgga ggccactgcc ttctgacatc tggctcaaaa gcccagtgtc tgtttccatt 2160 
tatttccctg gaatttcatt taaaattggt atagagaaaa aaaggatgtg acagaagcag 2220 
agatgaccag aaagcacagg ggcagggttc tgactggcgt gtgggagacc ctgtggccgg 2280 
cacccacctc cacacgagga ctaagctctg atttttttat cttgcccaaa ttcctaccta[ 2340 
aggggtctag ggagtcgcgc cttacaaatc ataaattctc atcagatggg ttttatttga 2400 
ccctgtatat catgacttat ttttaatctg actatggcat aacattacaa gacgaggcaa 2460 
aaatatttaa cccccaaata tatttctttg ccctaccttg aacttgccct gcagagtctc 2520 
ttgtgaggag aatccacatc ctataaagaa gcccctttcc cctttgtttt ccttcctttc 2580 
tttccagtcc aggagatcat caactaagag ccaggcaccc cttttaagtc gataagaaac 2640 
agtttacaac ctgctctctc tctctctgaa gtctgctgag agcttcccct gcacaataaa 2700 
acttggcctc cacaatcctt tatcttaacc tgaacattcc tttccattga tccoaggtct 2760 



44/59 



wo 03/000844 



PCT/US02/19360 



tcctcaacac tcagctctgc cagtttaggc cggatgcctt tggggggctg cccaggctgg 2820 
aggacctgga ggtcacaggc agtagcttct tgaacctcca ggtcctccag tttaggccgg 2880 
atgcctttgg ggggctgccc aggctggagg acctggaggt c 2921 

<210> 36 

<211> 2572 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7632424CB1 



<400> 36 

tagtgaccta 

ggcagtcgac 

aaggaattat 

aaccatcaaa 

tatttcttca 

tggtggatga 

taaaaacagg 

gaatagtaca 

caactaccat 

agctggtagg 

.ttactattag 

gtcactgtct 

tggacatagc 

atccagagtg 

atagatttca 

ttactggact 

ttgagaaggg 

tggcatgtca 

tgttacctca 

tccgcagagc 

ccagactatg 

ctggcttggg 

gagtagacat 

agatatgatt 

gagtacagta 

cctgccttag 

ttatttctat 

ctgaattcaa 

caccatgcct 

atttgtaata 

actttgaatc 

gttataccac 

aaatcctaat 

aggaaaatct 

ttatttgcat 

tttcattcta 

ttcccaagtt 

attgtagatg 

agtcccatga 

ggttttaatt 

gagagcccta 

actaaaacat 

aatttgtggg 



tagaacacta 
agtctttagt 
ctatagagta 
aaggaaagtt 
taaaatagta 
aactggaaat 
tgggattgtt 
agaagctctt 
aaaaccagga 
acagttaaaa 
gttgaccaat 
gttggcatta 
accaggtgac 
ctccaccatg 
ttttgacatc 
tcaacacgtt 
gcaaatcctg 
tcttgtgaaa 
aaataatgca 
tctggaaatt 
cactgataat 
cattttacat 
atcaaaagaa 
tctgctgttc 
gccagatcac 
cctcttgaat 
tttttgtaga 
gtgatcctcc 
agccaaaatg 
tttattacta 
aggcttcttg 
tcatgtcagt 
tttttgaaac 
gcttagtatc 
attaccattg 
aaatgtttta 
ggttcaaaca 
aaaaagaaaa 
agttagttac 
ttcttcggtg 
ggtatagtct 
actaagagaa 
gaccactata 



ttagtcgcat 
agggaaagga 
agtatgctaa 
tatgaatttt 
ttgggaattg 
gtgttgggag 
cctccagcag 
tctgccagtg 
cttgctttaa 
aagccattca 
aaagtagaat 
gttcaaggag 
atgcttgaca 
agtggtggga 
aaacctccct 
actgataaaa 
tcttcagcag 
agaacacatc 
gtactggttg 
ttaacaaacg 
ggcattatga 
gacatagaag 
gttggagaag 
aaaaaagtcc 
aactcactgc 
agccgggact 
gacaggctct 
caccttggcc 
tttcttaagg 
aatatctcaa 
cacaccaatt 
caataaat.tt 
cataactctg 
gagatcaaga 
ttaaaaataa 
ttagccttct 
cactattctt 
aatattatgg 
aggaggtaat 
ctccttccct 
cattacttct 
cagatcagta 
tggtttgcta 



gccgcgtcgt 
gacaagtgct 
tcttgactaa 
taagaagttt 
aaactagttg 
aagcaataca 
ctcaacagct 
gagtctctcc 
gcctgggagt 
ttcccattca 
ttcctttttt 
tttcagattt 
aggtggcaag 
aagccataga 
tgcatcatgc 
taataatgaa 
cagacattgc 
gggctattct 
catctggtgg 
caacacagtg 
ttgcatggaa 
gcatccgcta 
cttccataaa 
ctaaagggtc 
aaccctgact 
acaggtgtgc 
tgccatgttg 
tccagaagtg 
tatacatttt 
agtattacaa 
taaaaatgtt 
taaggtttaa 
acctgacact 
acttccattc 
aaaaattttt 
gcaggctctg 
ggtcttgata 
gaatccttgg 
aaagtcaaag 
aaaaattagc 
acttctatgt 
cttccagaca 
gcttgttatg 



aagctcggtg 
agctactgcc 
gactgcagga 
taattttcat 
tgatgataca 
ttcccaaact 
tcacagagaa 
aagtgacctc 
gggcttatca 
tcatatggag 
agttcttttg 
tctgcttctt 
aagactttct 
acatttggcc 
taaaaattgt 
aaaggaaaaa 
tgccacagta 
gttttgtaag 
tgtcgcaagt 
cactttgttg 
tggtattgaa 
tgaaccaaaa 
agtaccacaa 
tcactctctg 
tcctgaactc 
atgtccatgc 
cccgggctgg 
ctgggattat 
gggtcttaga 
taaatgttac 
aactcttgat 
gtgcaggcct 
aaattcctgt 
aaaaagatta 
aaaagatggc 
ggttggcttt 
catttgtgat 
tcttttttgt 
aatcttttgg 
cacaattaag 
agttctattt 
tttagatgtc 
tcaagttagc 



cagcgataag 
gcccaagtgg 
gtttttttta 
cctggaacac 
gcagctgctg 
gaagttcatt 
aatattcaac 
tcagcaattg 
tttagcttac 
gctcatgcac 
atttctggag 
ggaaagtctt 
ttaataaaac 
aaacaaggaa 
gatttttctt 
gaggaaggta 
cagcacacaa 
cagagagact 
aacttctata 
tgtcctcctc 
agactacgtg 
tgtcctcttg 
ttaaaaatgg 
acctcagctg 
aagaaatcct 
ccagccaact 
tcctgaactg 
gggtgtgagc 
agacttatac^ 
catgtgagct 
atatacacta 
ttgtttacag 
agacatgcta 
ttcagttatg 
tcaggaatcc 
ctggacatgt 
atctgatgta 
tgctttgaag 
ttgaatttta 
tgtctctcct 
tctttacata 
aggggccaaa 
ac 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840. 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2572 



<210> 37 
<211> 3758 
<212> DNA • 
<213> Homo sapiens 
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<220> 

<221> misc_feature 

<223> Incyte ID No: 1804436CB1 

<400> 37 

ggagctctgg agcctttgct tcctcaaata 
caggccgagt gctggcgagg cgcgcagttc 
gaagatactg ctataatttg aaaatttgaa 
cttcaagcca tcatgagctg taagaagcag 
tgtaatatga aaatcgagca ctatttttct 
agtacttctc taatgaggat ggagtctaga 
gctcaaagat tccattcacc taagaaaaat 
acaatatatg ttaccttgaa ggtaaaccac 
acacatagtg agaatagtag cttatatatg 
gagatagaaa ctcaccaagg ccaagaaatg 
tacataaacc ttggaatgcc cctcagttgt 
ttttcccaaa gtaaaagtaa gcagaaggaa 
gcatcgactg aatgtgtcaa att'ttacatt 
aggattgtta aatgtgggaa gcttcacaaa 
aaaggagaaa ccatcaagga tgcactgtgc 
aatgatgatt ggaaactcat tgaaaacaat 
gatgaattag aaggcagata ctttcaggtt 
gcagcttctc agaatcctga gtcagagaaa 
gtggctcagt accccagttt gaaaagagaa 
aaaatgaaag taaaaaatgg ggaaacatta 
gtaacaaaaa attcttcttc gattaaagta 
gttgggtact tattctggga cagtgcaact 
ggattgttca ttttaacttg tcggcatgta 
ccaagtaagt gggcaaccat aattggtcaa 
ctaaaagaca aggaaacaaa ctactttttt 
gagcttgact atgctgtcct gaaactgaag 
tataatggaa ttactcctgt gccacttagt 
ggagaaaaaa agcagattga tgcttgtgct 
tgtcaggaac gtgttcagtc taaaaaagca 
caaagaagtt tccagaaaat agttcacaac 
ttctttgggg cttccggctc ccctgtgttt 
gctgctggct ttgcttatac ttaccaaaat 
accatggaat ccatcctcct tgatattaag 
tttgtaaatc agcaggatgt agaaatgatg 
ctggatttaa gggaatggct tatggagttg 
aaactccaaa atggtcatct tatcaataat 
catttttcta agcacatgaa gaaattagtc 
gcccaaattt tttttttttt tttgagactg 
agtggtgcga tctcagctca ctgcaacttc 
tcagtctcct gagcagctgg gattacaggc 
tttttttttg tatttttagt agagacaggg 
ctcctgacct cgtgatccac ctgcctcggc 
ccactgcacc tggctaactt gccctatttt 
tatatagtaa tcagtttcat gacactaaaa 
catatctttg cctaaattca tttgctacag 
ttccctacat ctgacataag gaaagtaagt 
caaaagttgg ggctgcaatt aatgctaaca 
tcaaaacctt ttagaatttc caaaatcttc 
cctgaaattc agaagatgat agtcactctt 
tatcttaaac atcatattac tttatttaga 
atttctcatt gttttacttc ttcatggtat 
cctgcaaatc tccctcttgt ggaattccac 
aatactacta aagccataca atatcaagca 
aagaagcagg catgtaagat tttaaggact 
cctgttgcat tgtaagagtg atgtcatctt 
gactcctgca taccaagata ttctgcagca 
aacccctcat aaagatgctt atctaacctc 
tgagatatga ctagctacac gttttgccaa 



cgagcgggaa ctgcgttgag cgctggattc 60 
tctgctgttt aaaaagtatc cctgtgcttt 120 
attagtgttt cagctgaacc atccgttcat 180 
aggtcacgga agcactcagt caatgaaaaa 240 
ccggtctcta aagagcaaca gaataattgc 300 
ggagacccaa gagccacaac taatacccag 360 
ccagaagacc agaccatgcc ccaaaatagg 420 
aggagaaacc aagatatgaa acttaagctc 480 
gctctcaaca ctctccaggc tgtcagaaaa 540 
cttgtgcgtg gcacagaagg aatcaaagag 600 
ttccctgaag gtggccaggt ggtcattaca 660 
gataaccaca tatttggcag gcaggacaaa 720 
catgcaattg gaattgggaa gtgtaaaaga 780 
aaggggcgca aactctgtgt ttatgctttc 840 
aaggatggca gatttctttc ctttctggag 900 
gacaccattt tagaaagcac ccagccagtt 960 
gaggttgaga aaagaatggt ccccagtgca 1020 
agaaacacct gtgtgttgag agaacaaatc 1080 
agtgaaaaaa tcattgaaaa cttcaagaaa 1140 
tttgaattgc atagaacaac gtttgggaaa 1200 
gtgaaacttc ttgtacgtct cagtgactca 1260 
acgggttacg ccacctgctt tgtttttaaa 1320 
atagatagca ttgtgggaga cggaatagag 1380 
tgtgtaaggg tgacatttgg ttatgaagag 1440 
gttgaacctt ggtttgagat acataatgaa 1500 
gaaaatggac aacaagtacc tatggaacta 1560 
gggttgatac atattattgg ccatccatat 1620 
gtgatccctc agggtcagcg agcaaagaaa 1680 
gaaagtccag agtatgtcca tatgtatact 1740 
cctgatgtga ttacctatga cactgaattt 1800 
gattcaaaag gttcattggt ggccatgcat 1860 
gagactcgta gtatcattga gtttggctct 1920 . 
caaagacata aaccatggta tgaagaagta 1980 
agtgatgagg acttgtgaga attcagtcta 2040 
ttatttcata ggcattgaaa atggttttct 2100 
aataatattg acaatttcct atctgccagg 2160 
ctaacaacac tatgagatgg actataactt 2220 
agtctcactc tgtcgcctgg gctggagtac 2280 
cacctcccag gttcaagcga ttcttatgcc 2340 
aaacgccacc acacccagct aaaatttttt 2400 
tttcaccatg ttggtcaggc gggtctcgaa 2460 
cttccaaagt gctgggatta caagtttgag 2520 
aaagtcaagc aatgggaaga ataacaagat 2580 
gtcatatagt catagggttt tttcatcttt 2640 
tgcaggaacc aaaacttgtt catctcatga 2700 
gctcagaaaa atgtgcaggt caataagttg 2760 
taagagctaa atgcttgatt agaaatgatc 2820 
atattactga aactgtcgga atatatgggt 2880 
cccatattta taggctatta aggcaaggga 2940 
tttctactac tccaattatt aatgttatgt 3000 
tatgaagact atatagatga ttcaaccaag 3060 
tggacccaat ctgttttcca tttccattgc 3120 
ccctccctct aggtccaggg actatcacag 3180 
ggtttcgagg ggtcgagtgt aggaaaacag 3240 
gaagagcagc tggcatgatg actgctgttt 33 00 
atgtctttaa acagtgccgg tagtacagat 3360 
cccagtgttc aggtgtttca caagaaagtc 3420 
aaatgcttgt tatataaagg gtacttttgg 3480 



46759 



wo 03/000844 



PCT/US02/19360 



gagggtgagt gccgccattt agtggctgct agaaacattg cttctgtttg taagttccta 3540 
ttaaatgttt ctttctgaga aaaaaaagta tatgttttaa aattgacaag gtgcagtcaa 3 600 
tttctaggaa caggtgacca ctttttcaga ga tgaagtgt ttatattaat taaggagcac 3 660 
ttggtttctg tatctaataa tagaactgac ttagaagtag cagtaggtga tctccctcta 3720 
aagtccgggg gttctgcgcc tgggatctgc cacgagct 3758 

<210> 38 

<211> 1036 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> inisc_f eature 

<223> Incyte ID No: 7486358CB1 



<400> 38 

cgggcggcgg agccccggcg ccagagcatc gggttgctga tacttaggca ggtgctgtgg 60 

agccttccta tgagtcccca tgcagccatg ccttctgttc actcacagca cttttactcc 120 

tggcagctcc tccttttagt ccttgaccca tctggttttg tttccccacc ccactggcat 180 

gtccagggat caccttccac tcacacctct tgtacttcta ttcttatttc agggactgat 240 

cctgggacac ccgaggaagt gggtgctctc ctctagccca ctgaggcact agggagcagt 3 00 

ggtggctgga cctcatgccc tgctcttgcc cactggggtc ccttatcttt ctaacccacc 3 60 

cccatggttg ctgttacttc tccctgaaca agagatgatt gaaacagctt tgtctggagg 420 

aggcgaagcc acatttggga gggacagatt ataatgccag agatgaggcc tggagggagg 480 

agtttccacc tgagctcaag agtttaaggg ccagaactgt gtgcagtagg ctcctgtggg 540 

agtccaggct gggaaggctg gaggttagga gtgcttcctg tgccctgcgc catgtggagt 600 

ctgccgccga gcagggctct gtcctgtgcg ccactgctgc ttctcttcag cttccagttc 660 

ctggttacct atgcttggcg tttccaagag gaagaggagt ggaatgacca aaaacaaatt 720 

gctgtttatc tccctcccac cctggagttt gccgtgtaca cattcaacaa gcagagcaag 780 

gactggtatg cctacaagct ggtgcctgtc ctggcttcct ggaaggagca gggttatgat 840 

aagatgacat tctccatgaa tctgcaactg ggcagaacca tgtgtgggaa atttgaagat 900 

gacattgaca actgcccttt tcaagagagc ccagagctga acaatacctg cacctgcttc 960 

ttcaccattg gaatagagcc ctggaggaca cggtttgacc tctggaacaa gacgtgctca 1020 

ggcgggcatt cctgag 1036 



<210> 39 

<211> 3651 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> inisc_f eature 

<223> Incyte ID No: 7472344CB1 

<400> 39 

ggaggcgctg cgagcggagc cgcgcggagg 
ttgccgcctt cggggctcca gtccgcgcgc 
aggtgggcct cggctcggga cgccgggagt 
ggcgctgcgc gcccgggcgc tgtacgactt 
gcgagagcac gaggtgctga gcctgtgcag 
ggtcaacagc cgcggcgacc gcggcctctt 
ccccgagcct ggcccggcgg gagacggcgg 
gccccccggg ggcttcgagc ccctgcctgt 
tgacgccttc caggcgctgc tgcagccaca 
gcccggcgcg ggcttcccgt acggcggggg 
cggcggctac caggccagcc aaggcagcga 
ctccacggtg gcggacgagc cgggcgctct 
ctcgtcttcg gcgggtgtgg gcgcagccgg 
gtccctgggc tcccgcggcg gctcggtccc 
ctcggccacc gtgagccgca acctcaatcg 
ggccttcgtg ctgggggagg cgtcaggctt 
gctggggccc tatggccccg agtggcagga 
cgaccccacc aagcagacca agttcaaggg 



gcgcgaccgg ctggtccggg cagcgggggt 60 
cagtgctcga tgcagtaccg cgggcccctc 120 
cgggaccgcc agtcggggcg ccgggaccat 180 
caggtcggag aaccccggag agatctcgct 240 
cgagcaggac atcgagggct ggctcgaggg 300 
cccggcctcc tatgtgcagg tgatccgcgc 3 60 
cccgggcgcc ccggcccgct acgccaatgt 420 
cgcgcccccc gcctccttca agccgccgcc 480 
gcaggcgccg cctccgagca ccttccagcc 540 
cgccctgcag ccgtcgcctc agcagctcta 600 
tgatgactgg gacgacgagt gggacgacag 660 
gggcagcgga gcatacccgg acctcgacgg 720 
ccgctaccgc ctgtccacgc gctccgacct 780 
cccgcagcac cacccgtcgg ggcccaagag 840 
cttctccacc ttcgtcaagt ccggcgggga 900 
cgtgaaggac ggggacaagc tgtgcgtggt 960 
gaacccctac ccgttccagt gcaccatcga 1020 
catgaagagc tacatctcct acaagctggt 1080 
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gcccacgcac acgcaggtgc cggtgcatcg gcgctacaag cacttcgact ggctgtacgc 1140 
gcgcctggcg gagaagttcc cggtcatctc cgtgccccac ctgcccgaga agcaggccac 1200 
cggccgcttc gaggaggact tcatctctaa gcgcaggaag ggcctgatct ggtggatgaa 1260 
ccacatggcc agccacccag tgctggcgca gtgcgacgtc ttccagcact tcctgacgtg 1320 
ccccagcagc accgacgaga aagcctggaa gcagggcaag aggaaggccg agaaggacga 1380 
gatggtgggc gccaacttct tcctgaccct tagcacgccc cccgccgctg cccttgacct 1440 
gcaggaggtg gagagcaaga tagacggctt caagtgcttc accaagaaga tggacgacag 1500 
cgcgctgcag ctcaaccaca cggccaacga gttcgcgcgc aagcaggtga ccggcttcaa 1560 
aaaggagtat cagaaggtgg gccagtcctt ccgcggcctc agccaggcct ttgagctgga 1620 
ccagcaggcc ttctcggtgg gcctgaacca ggctatcgcc ttcaccggag atgcctatga 1680 
cgccattggc gagctcttcg cggagcagcc caggcaggac ctggatcccg tcatggacct 1740 
attagcgctg tatcaggggc atctggctaa cttcccggac atcatccacg ttcagaaagg 1800 
agctcttacc aaagtcaagg agagtaggcg acacgtggag gaagggaaga tggaggtgca 1860 
gaaggctgac ggcattcagg atcgctgtaa cactatttct tttgccactt tggctgaaat 1920 
tcaccacttc catcaaattc gagtgagaga ctttaaatca cagatgcagc atttcttaca 1980 
acaacaaata atatttttcc aaaaagttac ccagaagttg gaagaagctc ttcacaaata 2040 
tgatagtgtt taatgactgg acgttggatt atggactttt tcagttcaag gataatttct 2100 
acagcagaat aaaaactgct gtcaaagagc tattgccagc tatcagtggt ggtacaagga 2160 
cggttttgtg ttcatctgaa acccagctga atttataatt atgtaggaaa taaacagtta 2220 
atatggttat ataatagaaa cagtaccaca cattgtaact aaattatact atgtatgcct 2280 
acactaccat tgtaactttt ggaataatga ttatactatt tgccttattg ctttttgaag 23 40 
tatgggtatt ttagtgcata ctttgtagac ctcaaaaccc atgaagggtc tcaaagaagc 2400 
tggctggata caagcctgct gtggatgcct ttttactctc atagattggg attacctaaa 2460 
ttcaacctat tctctgttta caaactccaa ctagagcagc tatgcgacct tgtgccttta 2520 
.gactcttggt ttttcatttc tccccgtccc ttccccacct ttttaaagta agccacagct 2580 
tttctgattg aaagagtgaa aggccagtgc atataatgac aaactgatga taaccttata 2640 
ttggcagtag ggggtggggg gggcggtggg gtgggacgat cage tgt cat caatttgcac 2700 
agcaagtatt atctcctgat aagatgctgg tgaatgcagg ggagtgagat tcattgctca 2760 
tctttggata tgaagtctgt tagggaagaa acagtgccac tattccctta gatgcaacag 2820 
tagcatagcc tcttcaccca ggcgtcccaa aagcttggcg tgaagatttc agcaaacatg 2880 
tcttacaaca tgaggaggag gagtctaaat cagtcagggg ataaaagtat cgaatcattg 2940 
acaacacaca cttggcttta gttcttagga gggttttgtt tttgtttttg tttttaggtt 3000 
gaagattttc ttttaatatt cagttttttg aaaaaaaatg gatctacact gttaactgat 3060 
tgagactcca ctgtgattca cttgtttact taaaaacttt tcagggatgt ctgtaaattt 3120 
cagtgttaat atgtcatgaa aagtggtgtg gattgatcta aggagggacc agaaataatt 3180 
tttgctattc caaatactga aggaaaaaga taattgattt atactatgtt ttaaaaaaaa 3240 
aaaaggtatt gatgagcccc ccccccccag gacatttaac cttaaaattt attttaaatg 3300 
tattctttta ttattataag ggaaatacag- atggctgata aafeaccaaaa agattcaaaa 3360 
gcagcttaat ttaaaaagca caaagagatt ctggcttaca gtgccccaat ctcaatgttt 3420 
ttatagttgc tgagctaact aatgtgatta ttgagtttac agatttaaaa 'attgtcactg 3480 
ttagagtatc tactgttttt atgaagtcaa acttatgctg cctcagaaat ccctgggtac 3540 
tgaaatggta acatggaagt gaagaggtca ctttgaaata ttggtgagtc acaaagatta 3600 
aagaaaagga tcagtttgca gatactcaga aaaggttatt gaaataatta c , 3651 

<210> 40 
<211> 1989 
<212> DNA. 
<213> Homo sapiens 

<220> 

<221> misc^feature 

<223> Incyte ID No: 7192959CB1 

<400> 40 

ggggtctccg gatatcgggg ctatcaagga cggtctgggg gatgtgggac ttgcgtacgg 60 

ggcccaggga tggtgaagac ggggctgcgg ctggcagaat cagggcaggc agaggctgcg 120 

cgccgcggag gctgctggtc cctgacgcct tgacccttcc tctcccgcag ccacagccgg 180 

gcctggcggg gggaccatgg gcgcctcggt ctccaggggc cgggccgccc gggtccccgc 240 

gccggagccg gaacccgaag aggcgctgga cctgagccaa ctacccccag agctgcttct 3 00 

ggtggtgctg agccacgtcc ccccgcgcac gctgctcggg cgctgccgcc aagtgtgccg 360 

gggctggcga gccctggtgg acggccaggc cctgtggctg ctgatcctgg cccgcgacca 420 

cggcgccacc ggccgcgcgc tgctgcacct cgcccgcagc tgccagtctc ccgcccgtaa 480 

cgccaggcct tgccccctgg gccgcttctg cgcgcgcaga cccatcggac gcaaccttat 540 
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tcgcaacccc tgcggccaag aaggcctccg aaagtggatg gtgcaacacg gtggggacgg 600 

ctgggtggtg gaggaaaaca ggacaaccgt gcctggggcc ccttctcaga cgtgcttcgfc 660 

gacttcattc agctggtgtt gcaagaagca ggtcttggac ctagaggagg agggtctgtg 720 

gccagaactg ctggatagtg gcaggattga gatttgtgtc tctgactggt ggggagcccg 780 

acacgacagc ggctgtatgt acagactcct cgtccaactt ctagacgcca accagactgt 840 

tctagataaa ttctctgctg tgcctgatcc catcccgcag tggaacaaca atgcctgcct 900 

tcacgtcacc cacgtgttct ccaacatcaa gatgggcgtc cgctttgtgt ctttcgaaca 960 

ccggggccag gacacacagt tctgggctgg ccactatgga gcccgtgtga ccaactccag 1020 

tgtgatcgtg cgagtccgtc tgtcctagtc cagcactacc cttcttgcaa gacagcctga 1080 

ctgtgccttc cagggcctgg gaccattggc tgggacccct cattaaccaa ccaagcactt 1140 

gtacctccct ggcatactgg gaattcctgg gtccaatcaa aggccctgac gggccctgtc 1200 

ttcaggttct agaaactacc agaagaagct tcttccatct tatctaccta ctgcagctgc 1260 

tgctttgcgg gggggcccta ctgtactgag gggaaaccca caagtgagca tgggggaatg 1320 

cccatcctgg agaagaattc ttgccctccc ctctcttctc ctcccaacca aaaccctccg 1380 

atccagcccc aggcccctct gtaccatccc ctcccctccg caccactgac ctcttgtctc 1440 

ctattgtttc tgccacaggg acttccttgg tcccttcagg gaagccctca actcatctct 1500 

gaacttagaa tctcatcctt agggctcaga gagtcccagc cctaagggtg tgaacctcac 1560 

ctcctggggc ttccgagcta cagggctggg accaggtcat gcagctgaaa gtctacgtta 1620 

aaaaaaaaaa gtttgggcca ggcacagtgg ctcatgcctg taatcccagc actatgggag 1680 

gctgaggcag gcggatcacc tgaggtcagg agttcaagac cagcctggcc aacatagtga 1740 

aactgtgtct gtgaaggcag gcattgttgt tccactgcg;g gatgggatca ggcacagcag 1800 

agaatttatc tagaacagtc tggttggcgt ctagaagttg gacgaggagt ctgtacatac 1860 

agccgctgtc gtgtcgggct ccccaccagt cagagacaca aatctcaatc ctgccactat 1920 

ccagcagttc tggccacaga ccctcctcct ctaggtccaa gacctgcttc ttgcaacacc 1980 
agctgaatg 1989 

<210> 41 

•<211> 1629 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc^feature 

<223> Incyte ID No: 6169565CB1 

<400> 41 

cgggcgctag ggcgctggca atgtgtagcg gtcacgctgc gcgtaaccac cacacccgcc 60 
gcgcttaatg cgccgctaca gggcgcgtcc cattcgccat tcaggctgcg caactgttgg 120 
gaagggcgat cggtgcgggc ctcttcgcta ttacgccagc tggcgaaagg gggatgtgct 180 
gcaaggcgat taagttgggt aacgccaggt tttcccagtc acgacgttgt aaaacgacgg 240 
ccagtgaatt gaatttaggt gacactatag aagactcaaa ttactccttc ctcccctcta 3 00 
ggtttcctga tttgtaacct ttctaaactc ggtataaagg gggacattaa agtaaagtgc 3 60 
cttatattct tttgcaatac tgcctggcct caggccaagg gcaaccgttg gcctgaaaac 42 0 
ggcacattcg accaccaaat tttaagggat ttagataact tcatttatcg aaatggcaaa 480 
tggcaagaag ttccttcatt tacctttgct cacgtccttc cctctgccaa gattgttccc 540 
cacacctcta atttgctcct taatctctgt tgttggcata aagggcattc agaaaactcc 600 
cctccaaaca ctccccttat attgttcctt tcgggacgtc accctcatac attgcttcct 660 
gttaatccca cattgcccca tgcctcttct aagtagagac ctgctgcaca aattgcgtgg 720 
cttcctccac ctttgggctc ttggccaaag ccatccctac ttatttttat gccaggaacc 7 80 
caaattctcc ctacctgaag ttaaggagcc aacacctgac cttagcatta taactcaaac 840 
aaatcctatt gtttggtcaa cacaaattct gcagtcgtgg cggcccacca ctccccatta 900 
aaatttcatt aaaggatcct tctcattaga tcacagtcca acagtattct gtcagccctg 960 
agaggcttcg gggactcaag cccatcatct ccactgtaga actgccttct tgcctcttct 1020 
ttccttcatc tttctattcc aatacttgct gccaaaaccc gggatgagtc ttggggctga 1080 
gaactcttgc aacccaggaa gcagtgggcc ccgacagctc atccctggct aattcctgga 1140 
tcctgagggt tctctggctt cccgcctagt ccctcaactc ttcctctttt ttaaaagtga 1200 
tttgcatgag gtaattgagt gacaggagag gacttcgagg ctgtggccag ggctacccct 1260 
ggatgggttc tcaaaactcc cggacccaga attttgcctc caaccgctgg atctgggcaa 1320 
tttactttct gtttctcccc aacggctcca gtttgagagg tcctttgctg gttccttcta 1380 
aaacatccaa caccagacac taattagtaa ggtttgcccc cacctgtctc tcctgtgcta 1440 
cccaggagac tcaggtccgc agtcccggtc tcctggggac cattaggacc atcagagacc 1500 
ttgggagggt gacgttcagg gatgcctgac ctttcccctc agtctacctc ctcttgagag 1560 
gatctcccgt ctctgccctt tttctgggga tgcctagtct aacggcagac accctggcct 1620 



49/59 



wo 03/000844 



PCT/US02/19360 



ttccatcgc 



1629 



<210> 42 

<211> 3166 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7494717CB1 



<400> 42 

tcttgcagag 

tgtggaagcg 

gagcagcggc 

atggcggaga 

gcgagcagcg 

ttcctgccgg 

ggcctggctc 

agccactggg 

ctactccgga 

gttgtacctg 

actccttatg 

cacccacctc 

ttctaccgca 

agcccagccc 

ccctatcaca 

aatgaatgta 

tgtccctgtc 

ttctacgagg 

tttggagaga 

cgtcagaaag 

tcatctggga 

ccccttcccc 

gcactgtgta 

ctccgatccc 

gcatctgggg 

gcacctagag 

agctccttag 

gatttatggg 

tctgggtggt 

tcctccctgc 

atttatagtc 

tgatcagtaa 

ttttctagat 

cctgcatggc 

agttcccaac 

cttacagatt 

ggggagactg 

cagcctgtag 

gcagaggggg 

tcccccgtgc 

gcttctctat 

atgtgcttta 

tatgttctcc 

tgtgatgcag 

gccctgccct 

ctgtgcaaag 

ggaactggac 

aggctgggtc 

ttcctcgagc 

gtcatccatt 

gcagagcatg 



cgcctctcgc 

gaggggtggg 

cgctctggtc 

gtccgactga 

ttgctggtgt 

atgtgtgggc 

cgctgcccgg 

accccacgct 

tcaagcggga 

atactgttga 

aagggggttt 

gggtcaaact 

atgggaaagt 

agagcatctc 

atgagcccgg 

tccggcacga 

ctgaacccct 

tggcctgcaa 

agcggggcca 

tgctggagag 

cagagacaga 

cactcaagag 

tctccctcca 

agggtgtggg 

cttcgttcat 

cgagggcctt 

ggat^agtgg 

gatacctgga 

tgtctatttg 

cttccttccc 

actgttctag 

gctttaccta 

gtaagacaga 

accgtaagaa 

acaggcagct 

ttcataaata 

atgctagtcc 

tcctcctcac 

ttggggctgg 

ccccagctgc 

ctgtatttag 

ggtgccctgt 

ccgtacttcc 

ggcctcagtc 

ctgctatgaa 

gggggctggg 

' agggattggt 
tcttcacatg 
tactccaggg 
taagtgttcc 
tctggcatag 



tggttggggc 
gacactctgg 
ggcggacgtg 
ggaggcggca 
tgttggcgtt 
ggcggcggcg 
gctcccgccc 
cagctccgac 
tatcatgtcc 
catgactaag 
cttcctgttc 
gatgacaacg 
ctgcttgagt 
ctcagtgctc 
ctttgaacag 
gaccatcaga 
acgaggggtg 
agatcgcctg 
ctttgactac 
gctccataat 
ccttcatggg 
tcccagcaga 
gactcgaagt 
agtgggggcc 
ccattcatcc 
tggcaaaaac 
aagatggaaa 
agggagcttg 
tctgtcttca 
ccattatttc 
acagaccaaa 
gcacctgagc 
aagtaaatgt 
gctgaaaata 
gctgtgtata 
ctagtcacaa 
ttgttgtatt 
ttcaacccca 
gactctggag 
tcttgtcact 
ctgcagtgat 
aatcct.gggc 
agatttattg 
agtgtccagc 
ggtctctgcc 
cacaaaggag 
gggcttgggg 
aagatcgaaa 
cttagaagaa 
cactttcaag 
cagcctgact 



gggggtgggc 
cccggttctc 
ctgccgagta 
acggcgggcg 
agcggcagcg 
gcagcgggcg 
tcagccgctg 
tgggacggcg 
atttataagg 
attcatgcat 
gtgtttcggt 
ggcaataaca 
attctaggta 
atctctatcc 
gagagacatc 
gttgcagtct 
atggagaagt 
caccttcaag 
cagtccctct 
gagaatgcag 
agcctgaggg 
atcccttccc 
catcctgcaa 
tgttcccggt 
cgtatcaggg 
aaaacaacca 
ttgcaattcc 
gggtgggggc 
gtcttgaagc 
ccacaggcca 
gagaaggaac 
acctttctcc 
gactgggact 
ctgtttgttc 
tgggattaga 
taagggtatt 
ttgttgggct 
gggatttttg 
gctcctcccc 
gtctctgatg 
cctttagctg 
atcaagggaa 
ttatggctcc 
catgcataag 
ttgtggatca 
aatgtcctat 
aacggaagtt 
agggaccctg 
tgctcttggt 
tgacaatcct 
tttatgccct 



ggagccagca 

ggtggtgcgg 

gtcccggaag 

ccggggcggc 

gcggcgggtt 

gggccggggg 

cccacggggc 

agcgcaccgc 

agcctcctcc 

tgatcacagg 

gtccgcccga 

cagtgaggtt 

catggactgg 

agtccctgat 

caggagacag 

gtgacatgat 

cctttctgga 

gccaaactat 

tgatgcgcct 

aaatggactc 

tttagaccct 

cccaccccag 

gatggcaaga 

ctgacctcct 

gccaaggtac 

acacacctct 

aagagggagt 

tgtctgtgac 

agggcttccc 

gcataatttt 

agtggtggag 

cctcccctct 

taaccaaggt 

ccgcaatcac 

gccactacat 

tttcttgggg 

gtccttgtgt 

gggagcaagg 

ttctttctct 

ggtatttgcc 

gttggctcag 

tccatccttc 

cagtgggtat 

ggagaggata 

tgggactccc 

ttgggagggc 

tatcttggat 

cttccaattt 

ctgtgggtcc 

ctccttggcc 

aatcttgagt 



ccgtctgggc 60 
gagcgggcgg 120 
cgaagcagcg 180 
gggccccggg 240 
cgggccgcct 300 
cccggggagc 360 
cgcgctgctt 420 
gccgcagtgt 480 
aggaatgttc 540 
cccatttgac 600 
ctatcccatc 660 
taaccccaac 720 
acctgcctgg 780 
gactgagaac 840 
caaaaactat 900 
ggaaggaaag 960 
gtattacgac 1020 
gcaggaccct 1080 
gggactgata 1140 
tgatagcagt 1200 
gctcccatct 1260 
ggatggagag 1320 
accaagcaag 1380 
tggcactgga 1440 
ctttacagga 1500 
ccacagggcc 1560 
gtgcccaaat 1620 
acttaagcag 1680 
aatgcccttt 1740 
gtttttccta 1800 
tctaggctgc 1860 
ttcctcaccc 1920 
cttggtaaag 1980 
tgatttgaaa 2040 
agaatagtct 2100 
gtggagtaag 2160 
attttcaccc 2220 
gtagccaatg 2280 
tccttccgcc 2340 
tggctttgtt 2400 
aaaaaaaaaa 2460 
ccctttttga 2520 
tggcgattct 2580 
gtgtgtacct 2640 
cttggaggat 2700 
aggaagcaaa 2760 
acccttgaag 2820 
ccctcttcca 2880 
agtgttgtct 2940 
ctgccatagg 3000 
tgaggaaata 3060 
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tatgcacagg agtcaaagag atgtctttat atctgactgt atataaatga agtttttttg 3120 
ttttttttgt tttccttttt ggtgcataaa gtttgttttg gcagaa 3166 

<210> 43 

<211> 399 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc^feature 

<223> Incyte ID No: 7497510CB1 



<400> 43 

ctcctcccct 

gtggcaccat 

agcttcgatc 

ttcctatgga 

ctggtgaatg 

ggaaagaaaa 

tccatggatt 



tggtcccttt 
gcagcttcag 
agaactagca 
aaagggccct 
tgagttattt 
atgtgagaaa 
cgggattgct 



tctccctaaa 
gcctctctct 
cgagacacta 
tgtcaaacct 
gcttacggag 
ttctgcaagt 
ctgagggcca 



atggcaaccc 
cgtttctcct 
tcaaggatct 
acatgacgcg 
gctgcggagg 
tcacctgatt 
tagaaggca 



caggctgagc 
gattctcact 
cctcccaaat 
atggtttttc 
caacagcaac 
ttctaacaag 



cactggctga 
ctctgcctag 
gtatgcgctt 
aactttgaaa 
aactttttga 
aacacagccc 



60 

120 

180 

240 

300 

360 

399 



<210> 44 

<211> 1811 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: 



7498882CB1 



<400> 44 

ggtttccggg 

cgcagcgctc 

gctgggccct 

cgctgcgaca 

gtgggctcca 

aaagtagtgg 

catttcacca 

gcctttttta 

actgggtggg 

ggaactgcct 

tattctaata 

aagtcttgga 

aggagaagtg 

gaaatacagc 

atcaattttg 

gcaggaaagt 

actgattctc 

tatgtaggag 

catgggccca 

atctaccacc 

gttctgcttg 

aacagctggg 

gagtgtgcaa 

ccttccagta 

ctgtagactt 

cttcagaatt 

tttctaagta 

gtttgcttaa 

agacttattt 

taaaatagat 

aggggggggc 



ccggcgtact 
ggcttcctgg 
ccttgctgct 
cacctgccaa 
gcggttccca 
tgtaccttca 
tcatttacaa 
agtataaaga 
tgcatgatgt 
ctgagaatgt 
ggctctacaa 
ctgcaactac 
gtggccacag 
aaaagatttt 
tcagtcctgt 
acgcccaaga 
catgcaaaat 
gtttctatgg 
tggcagttgc 
acactggtct 
tgggctatgg 
gcaccggctg 
ttgagagcat 
tttcataatg 
tcagcagcaa 
aaaactgccc 
ttcaattaag 
tcatttgtaa 
ttaaattgta 
gctcatattt 

g 



atttcaaggc 
taattcttca 
cgccgccctc 
ctgcacctat 
gcgcgatgtc 
gaagctggat 
ccaaggcttt 
agagggcagc 
gttgggccgg 
gtatgtcaac 
gtatgatcac 
atacatggaa 
tcgaaaaatc 
gcatttgcca 
tcgaaaccaa 
ttttgggctg 
gaaggaagac 
aggctgcaat 
ttttgaagta 
aagagaccct 
cactgactca 
gggtgagaat 
agcagtggca 
atctgcatca 
tctcagaagc 
ttaattttaa 
tgggaatttt 
ttcaaacatg 
ccaatcacaa 
ttaaaataaa 



gcgcgcctcg 
cctcttttct. 

ctgctgcttc 
cttgacctgc 
aactgctcgg 
acagcatatg 
gagattgtgt 
aaggtgacca 
aactgggctt 
acagcacacc 
aactttgtga 
tatgagactc 
ccaaggccca 
acatcttggg 
ggctgtgaag 
gtggaagaag 
tgctttcgtt 
gaagccctga 
tatgatgact 
ttcaacccct 
gcctctggga 
ggctacttcc 
gccacaccaa 
gttgtaaagg 
ttacaaatag 
tatacctttc 
ctggaagatg 
ctatattttt 
gaaaataatg 
gttttaaaaa 



tggtggactc 
cagctccctg 
tctccggcga 
tgggcacctg 
ttatgggacc 
atgaccttgg 
tgaatgacta 
cttactgcaa 
gtttcaccgg 
ttaagaattc 
aagctatcaa 
ttaccctggg 
aacctgcacc 
actggagaaa 
gcggcttccc 
cttgcttccc 
attactcctc 
tgaagcttga 
tcctccacta 
ttgagctgac 
tggattactg 
ggatccgcag 
ttcctaaatt 
ggaattggta 
atttccatga 
aatcggccac 
gtcagctatg 
taaaatcaat 
gcaataatta 
taactgcaaa 



accgctagcc 
cagcatgggt 
cggcgccgtg 
ggtcttccag 
acaagaaaaa 
caattctggc 
caagtggttt 
cgagacaatg 
aaagaaggtg 
tcaggaaaag 
tgccattcag 
agatatgatt 
actgactgct 
tgttcatggt 
ataccttatt 
ctacacaggc 
tgagtaccac 
gttggtccat 
caaaaagggg 
taatcatgct 
gattgttaaa 
aggaactgat 
gtagggtatg 
tattcacaga 
agatatttgt 
tggccatttt 
aagtaataga 
gtgaaaacat 
tcaaaacttt 
aaaaaaaaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1811 
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<210> 45 

<211> 4407 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> inisc_f eature 

<223> Incyte ID No: 5524205CB1 

<400> 45 

cagtgtgctg gaaagtggac agatatactg 
gaaatccctg gcctccatat aagaaaagga 
tgagactttg cactgtaaaa ctttacacat 
gatggatcat caaaatctgt cagaacatgt 
aggacttgaa aattctgctg aagaagaatc 
acgttgtcat gacagttggt ttcctggcag 
aaactatgtt agagtaagag ttccagagac 
aagtactagc tctgataact tgggttcttt 
tttagtagca gaacgtagaa agaaatttca 
aaatcaggtg gttcgtccca aaacttcaag 
gttaactaca gccattttgg aaattaaaga 
ccacaaactc tcaggaaaac aaaactccta 
tttaatccaa ccagaaattc ctaaatacag 
aattttacta aaagctgcat cgcaaagtag 
tagaaaagtg acccctcctg taccacctaa 
ggacaaagaa gaaaggcgac agaaggctag 
tgcttcacga cagaaaagct ttatggaaac 
tattcctatg gagatcacca cggcagaacc 
tatttgtgga cagagtggcc cctcctctga 
acaagcatcc tcagttttgg ggcagtgccg 
cagtgaagag gagcagattt acccttggga 
ttcattatta cagcgttatt ttaaggatag 
ggaaggaggt gtttatgtac aaacctgtgg 
ttacatggaa tcattacgga atgaccaggt 
attcacgtgt ccactctgta ggcagtttgc 
caatgtggaa aataaccctt ggcaacgtcc 
ggaagtggag gagctgcagg gacgaccggg 
agaaatggaa tctgtaatga aagatataaa 
• tagcaagacc ccgggctcac cagacaatga 
caatttagaa cttgaattga ttcatcgagg 
agctggcaaa aggtcttgtt taaatcagct 
ttatagcatt gactctgagt ataatccctg 
tccacagctg ggatatgaag aacaacagcc 
atcccttttg ctcatccaga tcttaatgat 
ctgcattgtg aaggtacttt ttaccctact 
taaatgcagc gaagaagata ggtcagcctg 
atgtgatgca gaaaagtctt acgaagtatt 
aggaaagtta taccatgaag aaggaactca 
gtctcctgaa tccatggaaa aatgcttaca 
cagccttctt cagcaccacc tttttgggga 
attttcagtt cttgccagct gcctgggact 
attcatcagt gcctcctgtc tggattggcc 
gtgttttgag ataaaatcat ttactgaaag 
ccaagagtca aaatggaaat taccacacct 
ttttcagtac taccacagaa aaacctgtag 
tgtttgcctt gtgtgtggta cttttgtatg 
ttactgtgaa tgtgtactgc actctcagaa 
gatcaatgca tcggtaatta tcatcattcg 
gtatttggat gctcatggag aggaagaccg 
ttgtaaggaa agatacaaag ttcttgagca 
caataaaaga tggggtccac attacaatgg 
tatcatcatt ttcgctacga atttattttt 
ttaacacttt tgctgaggga gaaaaagaaa 



catttttaaa acagagtgga aaatttcctg 60 
catcactcca tcctagctat aaaggtctta 120 
tgtgctattc actctgcttt acaagatttt 180 
actctgcatg gttttatatc tgattgaatt 240 
agatgaagag gcatcagtgg gtggaccaga 300 
taacttagtg tcaaacatgc gacactttat 360 
tgctcctgaa gtaaagagag actcacctgc 420 
acaaaattct ggtacagctc aagttttcag 480 
ggaaatcatc aatcgcagta gcagtgaagc 540 
taaatggtct gctcctggtt cagctccaca 600 
aagcatattg tctttgctaa ttaaacttca 660 
ctatcctcct tggcttgatg acatagaaat 72 0 
tcatggagat ggtataactg ccgtggaaag 780 
aatgaacaaa cgcatcattg aagagatatg 840 
aaaagtcact gcagcagaga agaaaacatt 900 
agagaggcag cagaaattgc ttgcggagtt 960 
tgcaatggat gttgattctc ctgagaatga 1020 
acaggtttcc gaggcagtat atgactgtgt 1080 
agatcgacct actggattag ttgtactgtt 1140 
tgacaatgtt gagccaaaaa agttgccgat 1200 
tacctgtgca gccgttcatg atgtgaggct 1260 
ttcatgtctc ttggcagtat caattggctg 1320 
tcacacatta catatagatt gtcataaatc 13 80 
tcttcagggc ttctcggtgg acaaaggaga 1440 
taacagtgtt cttccatgtt atcctggaag 1500 
tagcaacaaa agcatacaag atctcataaa 1560 
agctttccca tcagaaacaa atttaagtaa 1620 
aaataccact cagaagaaat atagagacta 1680 
ttttctcttt atgtactctg ttgctagaac 1740 
aggcaatttg tgttcaggtg gtgcaagcac 1800 
gtttcatgta ttagccttgc acatgcggct 1860 
gagaaagctc acccagttag aagagatgaa 1920 
tgaggttcca attctttatc atgatgtaac 1980 
gccacaaccc ttacgcaaag accactttac 2040 
gtacacacag gctcttgcag cactctcagt 2100 
gaaacacgcg ggagctctca aaaagagtac 2160 
actgagcttt gtgataagtg aactatttaa 2220 
ggaatgtgca atggttaacc ctattgcttg 2280 
ggacttctgc ttaccttttc tcagaatcac 2340 
agatttacct agctgccagg aagaagaaga 2400 
tctgccaacg ttttaccaaa cagaacatcc 2460 
agttccagca tttgatatta taactcagtg 2520 
acatgcagaa caaggaaagg ccttgcttat 2580 
actacagttg cctgagaatt ataacaccat 2640 
tgtctgcacc aaggttccta aagatcctgc 2700 
cctgaaagga ctttgctgca agcaacaaag 2760 
ctgtggtgca ggaacaggta ttttcctttt 2820 
aggtcaccgc ttctgcctct ggggttccgt 2880 
ggatcttagg cgaggcaaac ctctctacat 2940 
acagtggatt tctcatactt ttgatcacat 3000 
gctgtgactc tccacctcag cattgcatcg 3060 
caacaataag ctttaactta atttggggga 3120 
acatacatta tgaagccttt ccaaaattag 3180 
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gtgcttggta atcacgttaa 
aacaagttaa attattcttt 
cttgttcaca aactcattca 
tacttagagt gggtgtgatt 
atttgcatct gccaagcata 
aatttttctg tatgtaatta 
gtaaaagttg gtctaatagt 
aagtttaaag ctttgctgaa 
aggtaaataa ttgaaatagc 
cattttgggg ggcatatgtg 
ttggatgaat tactagaggt 
tacactgggc agatgttgat 
ttatatgatt gtattagaaa 
ttcctttcag ctcaagaggt 
ctgctcggtt tcctaaaagg 
acgcttctgc attcttctta 
aaatcagtaa aaatgggaat 
aaaagcatgt gacatattgc 
attatttaat atatagtgct 
atataaaatt acatttttac 
acataatgtc agatccatgt 

<210> 46 
<211> 2177 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc__feature 
<223> Incyte ID No: 7102342CB1 

. <400> 46 

agcggaattc ccgacttccc aacggcttcc cgctggcagc cccgaagccg caccatgttc 60 
cgcctctggt tgctgctggc cgggctctgc ggcctcctgg cgtcaagacc cggttttcaa 120 
aattcacttc tacagatcgt aattccagag aaaatccaaa caaatacaaa tgacagttca 180 
gaaatagaat atgaacaaat atcctatatt attccaatag atgagaaact gtacactgtg 240 
caccttaaac aaagatattt tttagcagat aattttatga tctatttgta caatcaagga 300 
tctatgaata cttattcttc agatattcag actcaatgct actatcaagg aaatattgaa 360 
ggatatccag attccatggt cacactcagc acgtgctctg gactaagagg aatactgcaa 420 
tttgaaaatg tttcttatgg aattgagcct ctggaatctg cagttgaatt tcagcatgtt 480 
ctttacaaat taaagaatga agacaatgat attgcaattt ttattgacag aagcctgaaa 540 
gaacaaccaa tggatgacaa catttttata agtgaaaaat cagaaccagc tgttccagat 600 
ttatttcctc tttatctaga aatgcatatt gtggtggaca aaactttgta tgattactgg 660 
ggctctgata gcatgatagt aacaaataaa gtcatcgaaa ttgttggcct tgcaaattca 720 
atgttcaccc aatttaaagt tactattgtg ctgtcatcat tggagttatg gtcagatgaa 780 
aataagattt ctacagttgg tgaggcagat gaattattgc aaaaattttt agaatggaaa 840 
caatcttatc ttaacctaag gcctcatgat attgcatatc tactaattta tatggattat 900 
cctcgttatt tgggagcagt gtttcctgga acaatgtgta ttactcgtta ttctgcagga 960 
gttgcattgc aatgtggacc tgcaagctgt tgtgattttc gaacttgtgt actgaaagac 1020 
ggagcaaaat gttataaagg actgtgctgc aaagactgtc aaattttaca atcaggcgtt 1080 
gaatgtaggc cgaaagcaca tcctgaatgt gacatcgctg aaaattgtaa tggaagctca 1140 
ccagaatgtg gtcctgacat aactttaatc aatggacttt catgcaaaaa taataagttt 1200 
atttgttatg acggagactg ccatgatctc gatgcacgtt gtgagagtgt atttggaaaa 1260 
ggttcaagaa atgctccatt tgcctgctat gaagaaatac aatctcaatc agacagattt 1320 
gggaactgtg gtagggatag aaataacaaa tatgtgttct gtggatggag gaatcttata 13 80 
tgtggaagat tagtttgtac ctaccctact cgaaagcctt tccatcaaga aaatggtgat 1440 
gtgatttatg ctttcgtacg agattctgta tgcataactg tagactacaa attgcctcga 1500 
acagttccag atccactggc tgtcaaaaat ggctctcagt gtgatattgg gagggtttgt 1560 
gtaaatcgtg aatgtgtaga atcaaggata attaaggctt* cagcacatgt ttgttcacaa 1620 
cagtgttctg gacatggagt gtgtgattcc agaaacaagt gccattgttc gccaggctat 1680 
aagcctccaa actgccaaat acgttccaaa ggattttcca tatttcctga ggaagatatg 1740 
ggttcaatca tggaaagagc atctgggaag actgaaaaca cctggcttct aggtttcctc 1800 
attgctcttc ctattctcat tgtaacaacc gcaatagttt tggcaaggaa acagttgaaa 1860 



tggtataatt tttttttttt 
agtggtcatt ttttaagtgc 
gaaatgattc tcccaacaat 
tatttgacat tttactgctt 
atgcatcttt tttcctctgc 
gaaaaaaatg taaaacatga 
agaactttaa aattttttct 
aactgaattc attctcagga 
tgtaaaataa gtagatagct 
tggttggggg gtccttaaaa 
aataacaaat cttactataa 
tccgtgcatg cccacctttt 
tgctcagact tccccagaaa 
tcagctatat tgtatttgtg 
aaaaaaaagg cgcagtggtg 
gaaactgctg tgaaaaacaa 
gattgagcta aaacccactc 
tacaaagatt ttttttccta 
atcaagcaat ccctggtact 
atgtaaaaat aaactaaaca 
tctatac 



aatatctgga gaacattaat 3240 
acaattaata agaagcacaa 3300 
gcatatcagc tattcattga 3360 
ctttctgtct gtgtgtttta 3420 
cattcttgtg ttgattggag 3480 
tttatgtgaa atactgtata 3540 
tattgtgagg aatctgttaa 3600 
atttcataaa tcttctcccc 3660 
gctgttaata taatacagta 3720 
atcaaaattt gccatttcag 3780 
aatcaagagg tttaagaaca 3840 
attaccaaac aaggttttgt 3900 
tgaaccataa attttggaac 3960 
cagtgtaatc actactattt 4020 
atgaccctca tgaatgagcc 4080 
tttatgtttg cagggtttaa 4140 
tatgagaagg aagattactg 4200 
aatgattcag taatfcgaatg 4260 
ttggacttcc atggcttgtt 4320 
aatctaatga taaaatataa 43 80 

4407 
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aagtggttcg ccaaggaaga ggaattccca 
cagacatatg ccagccaatc cagctcagaa 
agatcagaaa gcagcagtca agctgatact 
acacaaagca gtagtaacta gtgattcctt 
gctaagaaat gaaaattctg tctttccttc 
gagtgtggat ctaaaaa 

<210> 47 

<211> 703 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc^feature 

<223> Incyte ID No: 4169939CB1 



agtagcgaat ctaaatcgga aggtagcaca 1920 
ggcagcactc agacatatgc cagccaaacc 1980 
agcaaatcca aatcacagga cagtacccaa 2040 
cagaaggcaa cggataacat cgagagtctc 2100 
cgtggtcaca gctgaaagaa acaataaatt 2160 

2177 



<400> 47 

atgatgctcc ggctgctcag ttccctcctc cttgtggccg ttgcctcagg ctatggccca 60 

ccttcctctc actcttccag ccgcgttgtc catggtgagg atgcgatccc catcaactct 120 

gaggagctgt ttgtgcatcc actctggaac cgctcgtgtg tggcctgtgg caatgacatc 180 

gccctcatca agctctcacg cagcgcccag ctgggagatg ccgtccagct cgcctcactc 240 

cctcccgctg gtgacatcct tcccaacaag acaccctgct acatcaccgg ctggggccgt 300 

ctctatacca atgggccact cccagacaag ctgcagcagg cccggctgcc cgtggtggac 360 

tataagcact gctccaggtg gaactggtgg ggttccaccg tgaagaaaac catggtgtgt 420 

gctggagggt acatccgctc cggctgcaac ggtgactctg gaggacccct caactgcccc 480 

acagaggatg gtggctggca ggtccacggt gtgaccagct ttgtttctgg ctttggctgc 540 

aacttcatct ggaagcccac ggtgttcact cgagtctccg ccttcatcga ctggattgag 600 

gagaccatag caagccacta gaaccaaggc ccagctggca gtgctgatcg atcccacatc 660 

ctgaataaag aataaagatc tctcagaaaa aaaaaaaaaa aaa 703 

<210> 48 

<211> 1295 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 6539977CB1 

<400> 48 

ggggaatgcc agttctggca ccaaccttcc tgctccctgc tggggcctct gctcccccat 60 
ctctcaggag tcgaaagtga gaaagcaaga catcaaggag ggacctgtgc cctgctccac 120 
atcctcccac ctgccgcccg cagagcctgc aggccccgcc cccctcgtct ctggtcccta 180 
cctctctgct gtgtcttcat gtccctgagg gtcttgggct ctgggacctg gccctcagcc 240 
cctaaaatgt tcctcctgct gacagcactt caagtcctgg ctatagccat gacacggagc 300 
caagaggatg agaacaagat aattggtggc tatacgtgca cccggagctc ccagccgtgg 360 
caggcggccc tgctggcggg tcccaggcgc cgcttcctct gcggaggcgc cctgctttca 420 
'ggccagtggg tcatcactgc tgctcactgc ggccgcccga tccttcaggt tgccctgggc 480 
aagcacaacc tgaggaggtg ggaggccacc cagcaggtgc tgcgcgtggt tcgtcaggtg 540 
acgcacccca actacaactc ccggacccac gacaacgacc tcatgctgct gcagcfcacag 600 
cagcccgcac ggatcgggag ggcagtcagg cccattgagg tcacccaggc ctgtgccagc 660 
cccgggacct cctgccgagt gtcaggctgg ggaactatat ccagccccat cgccaggtac 720 
cccgcctctc tgcaatgcgt gaacatcaac atctccccgg atgaggtgtg ccagaaggcc 780 
tatcctagaa ccatcacgcc tggcatggtc tgtgcaggag ttccccaggg cgggaaggac 840 
tcttgtcagg gtgactctgg gggacccctg gtgtgcagag gacagctcca gggcctcgtg 900 
tcttggggaa tggagcgctg cgccctgcct ggctaccccg gtgtctacac caacctgtgc 960 
aagtacagaa gctggattga ggaaacgatg cgggacaaat gatggtcttc acggtgggat 1020 
ggacctcgtc agctgcccag gccctcctct ctctactcag gacccaggag tccaggcccc 1080 
agcccctcct ccctcagacc caggagtcca ggcccccagc ccctcctccc tcagacccgg 1140 
gagtccaggc ccccagcccc tcctccctca gacccaggag tccaggcccc agcccctcct 1200 
ccctcagacc cgggagtcca ggcccccagc ccctcctccc tcagacccag gagtccaggc 1260 
cccagtccct cctccctcag acccaggagt ccagg 1295 
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<210> 49 
<211> 575 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: 7675588CB1 

<400> 49 

ccgggaggtc tgggtgttgg ctggtccatt ccaggacagc tacatctttg gcagacctgg 60 
tgctgacaga accagctctg atcaggaggc cagtacttcc taaaatggga ctctcaggac 120 
ttctgccaat cctggtacca ttcatccttt tgggggacat ccaggaacct gggcacgctg 180 
aaggcatcct tggcaagccg tgtcccaaaa tcaaagtgga atgcgaagtg gaagaaatag 240 
accagtgtac caaacccaga gattgcccag aaaacatgaa gtgttgcccg ttcagccgtg 300 
gaaagaaatg tttagacttc agaaaggtca gccttacttt ataccataag gaggagcttg 3 60 
aataacctcc aggatttggc tcataatcca ggcctctctc cacgtgtgcc tgattgatgc 420 
tccaaattgg cttccacggg ccaaaccttg gctgttccag aaactgaacc ccaggaattg 480 
cttacacact ttcttccagc gtagcatctc ttcaaacaca atgctcttcc ccttgaccac 540 
ttctcagtat gaaactctat gtcttcgggt cgttg 575 

<210> 50 

<211> 1062 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feat\ire 

<223> Incyte ID No: 6244077CB1 

<220> 

<221> unsure 
<222> 33-34 

<223> a, t, c, g, or other 

<400> 50 

ataaatttag agagacgtat ggccctcgag canngaattc ggcacgaggc ctccagtccc 60 
tgagaattgg tactacgaaa aggtgaactc ctgggcagaa tcttgcctag agcttgcgga 120 
gtccagccag gcccctgctg aagggcccca gaccaccggc cacttctccc ccgtccatct 180 
gaccagctgg gcccctgcgc ccacctggcc tccacgttcc ctctcctctc acccacaccc 240 ' 
ctggccatgg ctaactacta cgaagtgctg ggcgtgcagg ccagcgcttc cccggaggac 300 
atcaagaaag cctaccgcaa gctggccctt cgttggcacc ccgacaagaa ccctgacaat .360 
aaggaggagg cggagaagaa gttcaagctg gtgtctgagg cctatgaggt tctgtctgac 420 
tccaagaaac gctccctgta tgaccgtgct ggctgtgaca gctggcgggc tggtggcggg 480 
gccagcacgc cctaccacag ccccttcgac accggctaca ccttccgtaa ccctgaggac 540 
atcttccggg agtttttcgg tggcctggac cctttctcct ttgagttctg ggacagccca 600 
ttcaatagtg accgtggtgg ccggggccat ggcctgaggg gggccttctc ggcaggcttt 660 
ggagaatttc cggccttcat ggaggccttc tcatccttca acatgctggg ctgcagcggg 720 
ggcagccaca ccaccttctc atccacctcc ttcgggggct ccagttctgg cagctcgggg 780 
ttcaagtcgg tgatgtcgtc caccgagatg atcaatggcc acaaggtcac caccaagcgc 840 
atcgtggaga acgggcagga gcgcgtggag gtggaggaag acgggcagct caagtcggtg 900 
actgtgaacg gcaaggagca gctcaaatgg atggacagca agtaggcgct ggccacccgg 960 
ccctgccttc ccaccaccac caccgtgcat ggggcagcaa acacgtgggg ccgcagacat 1020 
agcctgatgg ttaataaatg tgccaagtga gttcatggca aa 1062 

<210> 51 
<211> 1029 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7498404CB1 . 
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<400> 51 

ggcttccggc atctggctca gttccgccat ggcctccttg gaagtcagtc gtagtcctcg 60 
caggtctcgg cgggagctgg aagtgcgcag tccacgacag aacaaatatt cggtgctttt 120 
acctacctac aacgagcgcg agaacctgcc gctcatcgtg tggctgctgg tgaaaagctt 180 
ctccgagagt ggaatcaact atgaaattat aatcatagat gatggaagcc cagatggaac 240 
aagggatgtt gctgaacagt tggagaagat ctatgggtca gacagaattc ttctaagacc 300 
acgagagaaa aagttgggac taggaactgc atatattcat ggaatgaaac atgccacagg 360 
aaactacatc attattatgg atgctgatct ctcacaccat ccaaaattta ttcctgaatt 420 
tattaggaag caaaaggagg gtaattttga tattgtctct ggaactcgct acaaaggaaa 480 
tggaggtgta tatggctggg atttgaaaag aaaaataatc agattatacc gaaaagaagt 540 
tctagagaaa ttaatagaaa aatgtgtttc taaaggctac gtcttccaga tggagatgat 600 
tgttcgggca agacagttga attatactat tggcgaggtt ccaatatcat ttgtggatcg 660 - 
tgtttatggt gaatccaagt tgggaggaaa tgaaatagta tctttcttga aaggattatt 720 
gactcttttt gctactacat aaaagaaaga tactcattta tagttacgtt catttcaggt 780 
taaacatgaa agaagcctgg ttactgattt gtataaaatg tactcttaaa gtataaaata 840 
taaggtaagg taaatttcat gcatcttttt atgaagacca cctattttat atttcaaatt 900 
aaataatttt aaagttgctg gcctaatgag caatgttctc aattttcgtt ttcattttgc 960 
tgtattgaga cctataaata aatgtatatt tttttttgca aaaaaaaaaa aaaaaaaaaa 1020 
aaaaaaaaa 1029 

<210> 52 

<211> 905 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> inisc_f eature 

<223> Incyte ID No: 7391748CB1 

<400> 52 

tggggctcaa aatataaact caggctattt atcaacttaa tctggggaag caaacctgaa 60 

ggcaagtacc accctgtcat ccctagctca gagctgctga gaaagaggat acagctgagc 120 

cccagggccc tcccatcccc tcgattctgg ttagctgcag tcttgccctc cccgtgctgt 180 

ctgcctaccc tgcagagctg gtggaccata gctcctgcag cccagaccta cctcttgctt 240 

ttgcagcaat ataaatgtca ccctgggcgc ccacaatatc cagagacggg aaaacaccca 300 

gcaacacatc actgcgcgca gagccatccg ccaccctcaa tataatcagc ggaccatcca 360 

gaatgacatc atgttattgc agctgagcag aagagtcaga cggaatcgaa acgtgaaccc 420 
agtggctctg cctagagccc aggagggact gagacccggg acgctgtgca ctgtggccgg 480 

ctggggcagg gtcagcatga ggaggggaac agatacactc cgagaggtgc agctgagagt 540 

gcagagggat aggcagtgcc tccgcatctt cggttcctac gacccccgaa ggcagatttg 600 

tgtgggggac cggcgggaac ggaaggctgc cttcaagggg gattccggag gccccctgct 660 

gtgtaacaat gtggcccacg gcatcgtctc ctatggaaag tcgtcagggg ttcctccaga 720 

agtcttcacc agggtctcaa gtttcctgcc ctggataagg acaacaatga gaagcttcaa 780 

actgctggat cagatggaga cccccctgtg actgactctt cttctcgggg acacaggcca 840 

gctccacagt gttgccagag ccttaataaa cgtccacaga gtataaataa aaaaaaaaaa 900 
aaaaa 905 

<210> 53 

<211> 2667 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: 7499780CB1 

<400> 53 

agtcctgtct cccgcccgcc ggccgagccg cgcccgtgcc ccgcctcccg tgcgcccggg 60 
acaatcctcg ccttgtctgt ggcgccggca tctggagctt tctgtagcct ccggatacgc 120 
ctttttttca gggcgtagcc ccagccaagc tgctccccgc ggcggccgca caagcagccc 180 
gagcgccccc tttccagagc tcccctccgg agctgggatc caggcgcgta gcggagatcc- 240 
caggatcctg ggtgctgtct gggcccgctc cccaccatga cctcctcggg gcctggaccc 300 
cggttcctgc tgctgctgcc gctgctgctg ccccctgcgg cctcagcctc cgaccggccc 360 
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cggggccgag acccggtcaa cccagagaag ctgctggtga tcactgtggc cacagctgaa 420 
accgaggggt acctgcgttt cctgcgctct gcggagttct tcaactacac tgtgcggacc 480 
ctgggcctgg gagaggagtg gcgagggggt gatgtggctc gaacagttgg tggaggacag 540 
aaggtccggt ggttaaagaa ggaaatggag aaatacgctg accgggagga tatgatcatc 600 
atgtttgtgg atagctacga cgtgattctg gccggcagcc ccacagagct gctgaagaag 660 
ttcgtccaga gtggcagccg cctgctcttc tctgcagaga gcttctgctg gcccgagtgg 720 
gggctggcgg agcagtaccc tgaggtgggc acggggaagc gcttcctcaa ttctggtgga 780 
ttcatcggtt ttgccaccac catccaccaa atcgtgcgcc agtggaagta caaggatgat 840 
gacgacgacc agctgttcta cacacggctc tacctggacc caggactgag ggagaaactc 900 
agccttaatc tggatcataa gtctcggatc tttcagaacc tcaacggggc tttagatgaa 960 
gtggttttaa agtttgatcg gaaccgtgtg cgtatccgga acgtggccta cgacacgctc 1020 
cccattgtgg tccatggaaa cggtcccact aagctgcagc tcaactacct gggaaactac 1080 
gtccccaatg gctggactcc tgagggaggc tgtiggcttct gcaaccagga ccggaggaca 1140 
ctcccggggg ggcaggaggt cttccatgaa ccccacatcg ctgactcctg gccgcagctc 1200 
caggaccact tctcagctgt gaagctcgtg gggccggagg aggctctgag cccaggcgag 1260 
gccagggaca tggccatgga cctgtgtcgg caggaccccg agtgtgagtt ctacttcagc 1320 
ctggacgccg acgctgtcct caccaacctg cagaccctgc gtatcctcat tgaggagaac 1380 
aggaaggtga tcgcccccat gctgtcccgc cacggcaagc tgtggtccaa cttctggggc 1440 
gccctgagcc ccgatgagta ctacgcccgc tccgaggact acgtggagct ggtgcagcgg 1500 
aagcgagtgg gtgtgtggaa tgtaccatac atctcccagg cctatgtgat ccggggtgat 1560 
accctgcgga tggagctgcc ccagagggat gtgttctcgg gcagtgacac agacccggac 1620 
atggccttct gtaagagctt tcgagacaag ggcatcttcc tccatctgag caatcagcat 1680 
gaatttggcc ggctcctggc cacttccaga tacgacacgg agcacctgca ccccgacctc 1740 
tggcagatct tcgacaaccc cgtcgactgg aaggagcagt acatccacga gaactacagc 1800 
cgggccctgg aaggggaagg aatcgtggag cagccatgcc cggacgtgta ctggttccca 1860 
ctgctgtcag aacaaatgtg tgatgagctg gtggcagaga tggagcacta cggccagtgg 1920 
tcaggcggcc ggcatgagga ttcaaggctg gctggaggct acgagaatgt gcccaccgtg 1980 
gacatccaca tgaagcaggt ggggtacgag gaccagtggc tgcagctgct gcggacgtat 2040 
gtgggcccca tgaccgagag cctgtttccc ggttaccaca ccaaggcgcg ggcggtgatg 2100 
aactttgtgg ttcgctaccg gccagacgag cagccgtctc tgcggccaca ccacgactca 2160 
tccaccttca ccctcaacgt tgccctcaac cacaagggcc tggactatga gggaggtggc 2220 
tgccgcttcc tgcgctacga ctgtgtgatc tcctccccga ggaagggctg ggcactcctg 2280 
caccccggcc gcctcaccca ctaccacgag gggctgccaa cgacctgggg cacacgctac 2340 
atcatggtgt cctttgtcga cccctgacac tcaaccactc tgccaaacct gccctgccat 2400 
tgtgcctttt tagggggcct ggcccccgtc ctgggagttg ggggatgggt ctctctgtct 2460 
ccccacttcc tgagttcatg ttccgcgtgc ctgaactgaa tatgtcacct tgctcccaag 2520 
acacggccct ctcaggaagc tcccggagtc cccgcctctc tcctccgccc acaggggttc 2580 
gtgggcacag ggcttctggg gactccccgc gtgataaatt attaatgttc cgcagtctca 2640 
ctctgaataa aggacagttt gtaaaaa 2667 

<210> 54 
<211> 2977 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> inisc_feature 

<223> Incyte ID No:- 7499881CB1 

<400> 54 

gtgcggagtg ccgagcggcc tcacccccaa ccgtcggccc agtcggacgg ttccgaggcg 60 
ttgccgggag ccgggcgcgg ctctgtgtgg actcggagaa acgcggggcg tctgcctgag 120 
cccgcttttc tacaagatgt ggggattttt gaagcgccct gtagtggtga cggctgacat 180 
caacttgagc cttgtggccc tgactgggat ggggttactg agccggctgt ggcgactcac 240 
ctacccgcgg gctgtggtgt tatttaggag gattcgatgg caattttttg tggaacagaa 300 
ttggagcaga atacagtagc aacgtgcctg tgtggtccct gcgcctgctg ccagcactcg 360 
cgggggcctt gtcggtcccc atggcctacc agatagtgtt ggagctccac ttttctcatt 420 
gtgccgccat gggagctgct ctgttgatgc ttatcgagaa tgctctcatc actcagtcaa 480 
ggctaatgct tttggaatca gtgttaatat ttttcaatct attggccgtg ttgtcctacc 540 
tgaagttctt caactgccaa aagcacagcc ctttttctct gagctggtgg ttctggctaa 600 
cactgacagg ggtcgcttgt tcctgtgcag tgggcatcaa gtacatgggt gtgttcacgt 660 
acgtgctcgt gctgggtgtt gcagctgtcc atgcctggca cctgcttgga gaccagactt 720 
tgtccaatgt aggtgctgat gtccagtgct gcatgaggcc ggcctgtatg gggcagatgc 780 
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ggatgtcaca gggggtctgt gtgttctgtc acttgctcgc ccgagcagtg gctttgctgg 840 
tcatcccggt cgtcctgtac ttactgttct tctacgtcca cttgattcta gtcttccgct 900 
cfcgggcccca cgaccaaatc atgtccagtg ccttccaggc cagcttagag ggaggactag 960 
ctcggatcac ccagggtcag ccactggagg tggcctttgg gtcccaggtc actctgagga 1020 
acgtctttgg gaaacctgtg ccctgctggc ttcattccca ccaggacacc taccccatga 1080 
tatatgagaa cggccgaggc agctcccacc agcaacaggt gacctgttac cccttcaaag 1140 
acgtcaataa ctggtggatt gtaaaggatc ccaggaggca ccagctggtg gtgagcagcc 1200 
ctccgagacc tgtgaggcac ggggacatgg tgcagctggt ccacggcatg accacccgct 1260 
ccctgaacac gcatgatgtt gcagcccccc tgagccccca ttcacaggag gtctcctgct 1320 
acattgacta taacatctcc atgcccgccc agaacctctg gagactggaa attgtgaaca 1380 
gaggatctga cacagacgtc tggaagacca tcctctcaga ggtccgcttt gtgcacgtga 1440 
acacttccgc tgtcttaaag ctgagcgggg ctcacctccc tgactggggg tatcggcaac 1500 
tggagatcgt cggggagaag ctgtcccggg gctaccacgg gagcacggtg tggaacgtgg 1560 
aggagcaccg atacggcgcg agccaggagc agagggagcg ggaacgggag ctgcactcac 1620 
ctgcgcaggt ggacgtcagc aggaacctca gcttcatggc gagattctcg gagctgcagt 1680 
gcraggatgct ggcgctgaga agtgatgact cggaacacaa gtacagctcc agcccactgg 1740 
agtgggtcac cctggacacc aatattgcct actggctgca ccccaggacc agcgctcaga 1800 
tccacctact tggaaacata gtgatctggg tttcgggcag cctcgctctg gccatctacg 1860 
ccctgctgtc cttgtggtac ctgctccgac ggcgaagaaa tgtccatgac ctccctcagg 1920 
atgcctggct gcgctgggtg ctggctgggg cgctgtgtgc cggtggctgg gcagtgaact 198 0 
acctcccgtt cttcctgatg gagaagacac tcttcctcta ccactacctg cccgcactca 2040 
ccttccaaat ccttctgctc cctgtggtcc tgcagcacat cagcgaccac ctgtgcaggt 2100 
cccagctcca gaggagcatc ttcagcgccc tggtggtggc ctggtactcc tccgcgtgcc 2160 
acgtgtccaa cacgctgcgc ccactcacct acggggacaa gtcactctcg ccacatgaac 2220 
tcaaggccct tcgctggaaa gacagctggg acatcttgat ccgaaaacac tagaacaaga 2280 
gtgtggcaaa gaacacccgt gctggggtcg ggacgaggtt gaagggtctt ggtcaatgta 2340 
cgtaatgagc agggtgggcc ccacgctggg aggacacggg ctgggctgag cagggcctct 2400 
agtggaacac atgggggtct cattgaaaag ctctctgatg agcacctcct tttgtgcaaa 2460 
gttaattttt tctcgacaat aaagatattc cgtgtcttca cccctgaact aagacacagg 2520 
gagtatttca gaggccagcg taggagtcat cgacaacgaa aagccgagaa cccagggcca 2580 
gcagtggagc ctcagcagac cagggcctgg tccttgctaa ttgctgcagg gtggagtttg 2640 
atctggcaga cccgatcctc cttcatgaac acccagcaac ctgagcaagt cccggccctg 2700 
ccctcagcga gcccggcagg cgtcccggga cagctcagtg ttggagggcc acctgaacca 2760 
cgagccaggg ctggggcttg catgtcattg tctatgacag cgtcaagact ggcccttggc 282 0 
accgtgctgt gtggaaaccc tcccctctga gactccactg agacgtggct gagtgaaatc 2880 
ttcctcgtca gtggtcaagg tgtgtcatcc atacagctcc atgcctttgt cttttttaaa 2940 
tgtaattaaa aaaggaacca actggaaaaa aaaaaaa 2977 

<210> 55 
<211> 729 

<212> DNA • ^ ■ 

<213> Homo sapiens 

<220> 

<221> inisc_feature 

<223> Incyte ID No: 7488579CB1 

<400> 55 

atgctcctcc ttgctcccca gatgctgaat ctgctgctgc tggcgctgcc cgtcctggcg 60 

agccgcgcct acgcggcccc tgccccaggc caggccctgc agcgagtggg catcgttggg 120 

ggtcaggagg cccccaggag caagtggccc tggcaggtga gcctgagagt ccacggccca 180 

tactggatgc acttctgcgg gggctccctc atccaccccc agtgggtgct gaccgcagcg 240 

cactgcgtgg gaccggacgt caaggatctg gccgccctca gggtgcaact gcgggagcag 300 

cacctctact accaggacca gctgctgccg gtcagcagga tcatcgtgca cccacagttc 360 

tacatcatcc agaccggggc ggacatcgcc ctgctggagc tggaggagcc cgtgaacatc 42 0 

tccagccaca tccacacggt cacgctgccc cctgcctcgg agaccttccc cccggggatg 480 

ccgtgctggg tcactggctg gggcgacgtg gacaataatg tgcacctgcc gccgccatac 540 

ccgctgaagg aggtggaagt ccccgtagtg gaaaaccacc tttgcaacgc ggaatatcac 600 

accggcctcc atacgggcca cagctttcaa atcgtccgcg atgacatgct gtgtgcgggg 660 

agcgaaaatc acgactcctg ccagggtgac tctggagggc ccctggtctg caaggtgaat 720 
ggcacctaa 729 

<210> 56 . 
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<211> 1879 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: 7510521CB1 

<400> 56 

ggtttccggg ccggcgtact atttcaaggc 

cgcagcgctc ggcttcctgg taattcttca 
gctgggccct ccttgctgct cgccgccctc 
cgctgcgaca cacctgccaa ctgcacctat 
gaccacaaga aaaaaaagta gtggtgtacc 
ttggcaattc tggccatttc accatcattt 
actacaagtg gtttgccttt tttaagtata 
gcaacgagac aatgactggg tgggtgcatg 
ccggaaagaa ggtgggaact gcctctgaga 
attctcagga aaagtattct aataggctct 
tcaatgccat tcagaagtct tggactgcaa 
tgggagatat gattaggaga agtggtggcc 
caccactgac tgctgaaata cagcaaaaga 
gaaatgttca tggtatcaat tttgtcagtc 
gctaictcatt tgcttctatg ggtatgctag 
ctcagacccc aatcctaagc cctcaggagg 
gtgaaggcgg cttcccatac cttattgcag 
aagaagcttg cttcccctac acaggcactg 
ttcgttatta ctcctctgag taccactatg 
ccctgatgaa gcttgagttg gtccatcatg 
atgacttcct ccactacaaa aaggggatct 
acccctttga gctgactaat catgctgttc 
ctgggatgga ttactggatt gttaaaaaca 
acttccggat ccgcagagga actgatgagt 
caccaattcc taaattgtag ggtatgcctt 
taaaggggaa ttggtatatt cacagactgt 
aaatagattt ccatgaagat atttgtcttc 
cctttcaatc ggccactggc catttttttc 
aagatggtca gctatgaagt aatagagttt 
attttttaaa atcaatgtga aaacatagac 
ataatggcaa taattatcaa aacttttaaa 
taaaaataaa aaaaaaaaa 



gcgcgcctcg tggtggactc accgctagcc 60 
cctcttttct cagctccctg cagcatgggt 120 
ctgctgcttc tctccggcga cggcgccgtg 180 
cttgacctgc tgggcacctg ggtcttccag 240 
ttcagaagct ggatacagca tatgatgacc 300 
acaaccaagg ctttgagatt gtgttgaatg 360 
aagaagaggg cagcaaggtg accacttact 420 
atgtgttggg ccggaactgg gcttgtttca 480 
atgtgtatgt caacacagca caccttaaga 540 
acaagtatga tcacaacttt gtgaaagcta 600 
ctacatacat ggaatatgag actcttaccc 660 
acagtcgaaa aatcccaagg cccaaacctg 720 
ttttgcattt gccaacatct tgggactgga 780 
ctgttcgaaa ccaagcatcc tgtggcagct 840 
aagcgagaat ccgtatacta accaacaatt 900 
ttgtgtcttg tagccagtat gctcaaggct 960 
gaaagtacgc ccaagatttt gggctggtgg 1020 
attctccatg caaaatgaag gaagactgct 1080 
taggaggttt ctatggaggc tgcaatgaag 1140 
ggcccatggc agttgctttt gaagtatatg 1200 
accaccacac tggtctaaga gaccctttca 1260 
tgcttgtggg ctatggcact gactcagcct 1320 
gctggggcac cggctggggt gagaatggct 1380 
gtgcaattga gagcatagca gtggcagcca 1440 
ccagtatttc ataatgatct gcatcagttg 1500 
agactttcag cagcaatctc agaagcttac 1560 
agaattaaaa ctgcccttaa ttttaatata 1620 
taagtattca attaagtggg aattttctgg 1680 
gcttaatcat ttgtaattca aacatgctat 1740 
ttatttttaa attgtaccaa tcacaagaaa 1800 
atagatgctc atatttttaa aataaagttt 1860 

1879 
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